LNCS 





Springer 




Lecture Notes in Computer Science 1663 

Edited by G. Goos, J. Hartmanis and J.van Leeuwen 




springer 

Berlin 

Heidelberg 

New York 

Barcelona 

Hong Kong 

London 

Milan 

Paris 

Singapore 

Tokyo 




Frank Dehne Arvind Gupta 
Jorg-Riidiger Sack Roberto Tamassia (Eds.) 



Algorithms and 
Data Structures 



6th International Workshop, WADS ’99 
Vancouver, Canada, August 11-14, 1999 
Proceedings 




Springer 




Series Editors 

Gerhard Goos, Karlsruhe University, Germany 
Juris Hartmanis, Cornell University, NY, USA 
Jan van Leeuwen, Utrecht University, The Netherlands 



Volume Editors 

Frank Dehne 
Jorg-Rudiger Sack 

Carleton University, School of Computer Science 
1 125 Colonel By Drive, Ottawa, Canada KIS 5B6 
E-mail: {dehne,sack} @scs.carleton.ca 

Arvind Gupta 

Simon Fraser University, School of Computing Science 
Burnaby, BC, Canada V5 A 1S6 
E-mail:arvind@cs.sfu.ca 

Roberto Tamassia 

Brown University, Center for Geometric Computing 
Providence, RI 02912-1910, USA 
E-mail: rt@cs.brown.edu 

Cataloging-in-Publication data applied for 

Die Deutsche Bibliothek - CIP-Einheitsaufnahme 

Algorithms and data structures : 6th international workshop ; 
proceedings / WADS ’99, Vancouver, Canada, August 11 - 14, 1999. 
Frank Dehne ... (ed.). - Berlin ; Heidelberg ; New York ; Barcelona ; 
Hong Kong ; London ; Milan ; Paris ; Singapore ; Tokyo : Springer, 
1999 

(Lecture notes in computer science ; Vol. 1663) 

ISBN 3-540-66279-0 



CR Subject Classification (1998): F.2, E.l, G.2, 1.3.5, H.3.3 
ISSN 0302-9743 

ISBN 3-540-66279-0 Springer- Verlag Berlin Heidelberg New York 



This work is subject to copyright. All rights are reserved, whether the whole or part of the material is 
concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, 
reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication 
or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, 
in its current version, and permission for use must always be obtained from Springer- Verlag. Violations are 
liable for prosecution under the German Copyright Law. 

© Springer- Verlag Berlin Heidelberg 1999 



Typesetting: Camera-ready by author 

SPIN: 10704216 06/3142 - 5 4 3 2 1 0 Printed on acid-free paper 




Preface 



The papers in this volume were presented at the Sixth Workshop on Algorithms and 
Data Structures (WADS ’99). The workshop took place August 11 - 14, 1999, in 
Vancouver, Canada. The workshop alternates with the Scandinavian Workshop on 
Algorithms Theory (SWAT), continuing the tradition of SWAT and WADS starting 
with SWAT’88 and WADS’89. 

In response to the program committee’s call for papers, 71 papers were 
submitted. From these submissions, the program committee selected 32 papers for 
presentation at the workshop. In addition to these submitted papers, the program 
committee invited the following researchers to give plenary lectures at the workshop: C. 
Leiserson, N. Magnenat-Thalmann, M. Snir, U. Vazarani, and J. Vitter. 

On behalf of the program committee, we would like to express our appreciation 
to the six plenary lecturers who accepted our invitation to speak, to all the authors who 
submitted papers to WADS’99, and to the Pacific Institute for Mathematical Sciences 
for their sponsorship. Finally, we would like to express our gratitude to all the people 
who reviewed papers at the request of the program committee. 

August 1999 F. Dehne 

A. Gupta 
J.-R. Sack 
R. Tamassia 
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Optimization over k-set Polytopes and Efficient 
k-set Enumeration 



Artur Andrzej ak^ and Komei Pukuda^ 

^ Institute of Theoretical Computer Science, ETH Zurich, CH-8092 Zurich, 

Switzerland, 
arturQinf . ethz . ch 
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^ Institute for Operations Research, ETH Zurich, CH-8092 Zurich, Switzerland, 
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WWW home page: http://www.ifor.math.ethz.ch/staff/fukuda/fiikuda.htinl 



Abstract. We present two versions of an algorithm based on the reverse 
search technique for enumerating all A:-sets of a point set in . The key 
elements include the notion of a fc-set polytope and the optimization of 
a linear function over a k-set polytope. In addition, we obtain several 
results related to the fc-set polytopes. Among others, we show that the 
1-skeleton of a k-set polytope restricted to vertices corresponding to the 
affine k-sets is not always connected. 



1 Introduction 

Let S' be a set of n points in A k-set 0 / S is a set P of A: points in S 
that can be separated from S\P by a hyperplane. The problem of enumerating 
the k-sets has many applications in computational geometry ([AW97a]), among 
others in computation of higher-order Voronoi diagrams ([Mul93]), in orthogonal 
Li hyperplane fitting ([KM93]) and in halfspace range searching ([AM95]). The 
first output-sensitive algorithm for enumerating k-sets was given in [EW86] (for 
M^), and other such algorithms appeared in [Mul91,AM95]. 

While the above algorithms concentrate on time-efficiency and require sophis- 
ticated data structures, we present here two output-sensitive algorithms which 
are highly memory-efficient. They are based on the reverse search technique 
described in [AF96]. Except for being memory-efficient, the reverse search al- 
gorithms are easy to implement since they do not depend on complicated data 
structures. In addition, reverse search allows parallel computation with high 
speed-up factors. These advantages make it possible to handle problem instances 
where other methods fail, as exhaustive search computations are frequently lim- 
ited by memory requirements, and less by time resources. 

While developing these algorithms we obtain several results related to the k- 
set polytopes introduced in [EVW97,AW97b]. For S as above and A: E {1, . . . , n- 
1}, the k-set polytope Qk{S) is the convex hull of the set 

Xk(S) = {'£p\T€ (^)}. 
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The most important property of these polytopes is the fact that there is a bijec- 
tion between the A:-sets of S and the vertices of Qk{S). In Section 2 we describe 
some properties of Qk{S) and prove that the diameter of Qk{S) is at most ( 2 ). 

For the first algorithm for enumerating A:-sets we show in Section 3 how to 
optimize a linear function over Qk{S) and how to find a unique path from a 
vertex of Qk(S) to a unique optimal vertex. The key notion for this task is that 
of polytopal neighbors in Qk{S), i.e. two vertices of Qk{S) connected by an edge. 
We discuss how to find all polytopal neighbors of a given vertex of Qk(S) in an 
output-sensitive fashion by applying geometric duality and linear programming. 

The second algorithm for enumerating k-sets is obtained in Section 4 by con- 
sidering combinatorial neighbors of Qk{S). Two k-sets Ti, T 2 (or corresponding 
vertices of Qk{S)) are called combinatorial neighbors if |Ti fl T 2 I = A: — 1. While 
two polytopal neighbors are also combinatorial neighbors, we prove that the con- 
verse is not true. We give a method to determine all combinatorial neighbors of 
a given A:-set which is simpler than the algorithm for polytopal neighbors. 

In Section 5 we discuss affine A;-sets. An affine k-set 0 / 5 is a set P of k 
points in S such that there is a hyperplane which has the points in P on one 
side and the points in S\P together with the origin on the other side. The afSne 
fc-sets correspond to certain cells in an arrangement of hyperplanes obtained by 
geometric duality from S. They can be used to solve the problem of finding the 
maximum feasible subsystem of linear relations (Max FLS, see [AK95]). We 
show that in the 1-skeleton of Qk{S) restricted to vertices corresponding to 
the affine A;-sets is not always connected. As a consequence, obtaining an output- 
sensitive algorithm for enumerating the affine fc-sets remains an open question. 

The following definitions are used throughout this paper. Let S denote a set 
of n points. We always assume that S is in general position, i.e. no ^ -h 1 points 
are on a common (i — l)-flat for i = 1, . . . , d. If /i is an oriented hyperplane in 
E^, then denotes the (open) halfspace on the positive side of h and h~ the 
(open) halfspace on the negative side of h. We say that a hyperplane h separates 
the sets of points A and B, if the points in A are contained in one side of h 
and the points in B are contained in the other side of h. (Thus our notion of 
separation is what is sometimes called strong separation by a hyperplane). For 
two vectors v and w, their inner product is denoted by v • w. We denote by 
lp(Z,m) the time needed for solving a linear inequality system with I variables 
and m inequalities. 

2 The fc-set polytope 

2.1 Basic properties 

Assume that k £ {1, ... ,n — 1}. For T £ (^) we denote by v(T) the sum 
^peTP element of Xk{S)) and for v £ Xk{S) we denote by T(t;) the unique 
set T £ (^) such that Y^p^rpp = v. An oriented hyperplane h is called an (i,j)- 
plane of S if h contains exactly i points of S and has exactly j points of S on 
its positive side. 

The following theorem is given in [AW97b] in a slightly different form. 




3 



Theorem 1. (a) For k £ {1,.. . ,n — l},T £ (^) is a k-set of S if and only if 
v(T) is a vertex of Qk{S). 

(b) Let h be an {jij 2 )-pld'n>e of S with normal vector c, say, for some integers 

ji ^ 1; J 2 > 0. Then for each js = 1,... ,ji there is a hyperplane h\ 
with normal vector c which contains a face F of exactly (^.^) 

vertices. (We say that h induces F in Qj 2 +js{S))‘ 

Furthermore, the vertices of F are exactly the points 

v(Ti U (5 n /i+)) 

for allT, e 

(c) For A: € {1, ... ,n — 1} and a face F of Qk(S), there is a {ji, j‘ 2 )-plo>ne of S 
which induces F for som,e j\ > 1 and j ‘2 > 0. 

Proof: Omitted. 

We call two vertices of Qk{S) polytopal neighbors if they are connected by 
an edge in Qk{S). 

Corollary 1. Assum,e that k £ {1, . . . ,n - 1} and Ti,T 2 £ Ti / T 2 . 

(a) The points v(Ti) and v(T 2 ) are polytopal neighbors if and only if there is a 
(2, k — l)-plane h with /i“^ D 5 = Ti fl T 2 . 

(b) If the points v(Ti) and v(T 2 ) are polytopal neighbors, then \Ti flT 2 | = A: - 
and Ti U T 2 is a (k + l)-set. 

(c) A vertex vi of Qk{S) has at w,ost k{n — k) polytopal neighbors. 

Proof: Omitted. 

2.2 The diameter of the fe-set poly tope 

If two vertices v(Ti), v(T 2 ) are polytopal neighbors in Qk{S), then by Corollary 
1 we have 

v(Ti) - v(T 2 ) = Pi -p ‘2 

for some pi,P 2 € S. It follows that the A:-set polytope has at most ( 2 ) edge 
directions. By the following result of parametric linear programming we can 
show that the diameter of Qk{S) is at most ( 2 ): 

Lemma 1. For a polytope Q and two vertices v\, V 2 , there exists a path from 
vi to V 2 which uses each edge direction at most once. 

Proof: Let ci , C 2 G , ci ^ C 2 be the normal vectors of supporting hyperplanes 
at vi, ^ 2 , respectively. Consider the parametric function 

h{0, X) = (0C2 + (1 - 0)ci)^x. 

It is not hard to see that there is a hyperplane with normal vector (0C2-f-(l— 0)ci) 
supporting a vertex or an edge of Qk{S), and that this hyperplane “follows” a 
path from vi to V 2 , while 6 goes from 0 to 1. Simultaneously the expression in 
parenthesis takes no two equal values (since it is a convex hull of ci and C 2 ). 
Therefore each edge on the path from vi to V 2 has a different direction. □ 
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3 Optimization over the fc-set polytope 

3.1 The fe-cells 

Let us identify U :=W^ with a hyperplane in determined by the equation 
Xrf+i = 1. Let be the unit sphere in with center in the origin. Each 
point q in or in U determines an oriented hyperplane h(q) which contains the 
origin o of and has normal vector (^ — o). We denote the sphere which is an 
intersection of with h{q) by hsp{q). The intersection of h{q) with U is denoted 
by hg^ff(q)- Now given a hyperplane h in with normal vector n containing 
the origin (or a corresponding sphere in or a corresponding hyperplane h in 
[/), let I be an oriented line parallel to n which goes through the origin. We 
denote the intersection of I with by Psp(^) ^nd the intersection of I with 
U by Pa,ff{h). The horizon of is the intersection of with the hyperplane 
determined by the equation Xd+i = 0. The introduced mappings give a well- 
known geometric duality between points and hyperplanes. It has the properties 
of incidence and order preservation (see [Ede87]). 

We denote by the spherical arrangement induced by the oriented spheres 
^sp(p)j p ^ S. The cell of containing the “north pole”, i.e. the point 
(0, . . . ,0, 1) is called the base cell 5. It is not hard to see that B is contained 
on the positive side of hsp{q) for each p e S. For a cell c of Asp, we call a ridge r 
(a {d — l)-dimensional face of ^p) of c a horizon ridge, if it is an intersection of 
spheres h\, h 2 of Asp such that /12 bound c, c lies on the positive side of hi 
and simultaneously c lies on the negative side of h^. We say that a hyperplane 
h bounding a cell c is visible (for c) if h has c on its negative side, otherwise h 
is called invisible (for c). 

For A; E {0, . . . , n - 1}, a cell c of ^sp is called a k-cell if the shortest path 
from the relative interior of the base cell to the relative interior of c traverses 
exactly k spheres of Asp- It is not hard to see that the duality stated above gives 
us a one-to-one mapping between the A:-sets of S and the A:-cells of Asp for all 
k e {I, . . . ,n - 1}. We denote by c(T) the fc-cell corresponding to a A:-set T of 
5. 



3.2 Determining polytopal neighbors 

Recall that two vertices of Qk{S) are called polytopal neighbors if they are 
connected by an edge in Qk{S). The corresponding k-sets and the corresponding 
A;-cells are also called polytopal neighbors. The following lemma suggests how to 
determine the polytopal neighbors of a given fc-cell c if the horizon ridges of c 
are given. Using the one-to-one mapping between the A:-sets and the /c-cells, we 
can compute all neighbors of a given vertex of Qk(S). 

Lemma 2. Assume that A: € {2, . . . ,n — 1} and T\, T 2 are k-sets. 

(a) The vertices v(Ti) and v(T2) of Qk{S) are polytopal neighbors if and only 
if the k-cells c{Ti) and c(T 2 ) of Asp share a horizon ridge. 
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(b) Assume that the horizon ridge r is an intersection of spheres hsp{pi), 
hsp{P 2 )y PiyP 2 € S such that c(Ti) is on the positive side of hsp{pi) and 
simultaneously on the negative side of hsp{p 2 )- Then c(T 2 ) is separated from, 
the base cell B by the hyperplanes 

[hsp{p)\p ^ Ti\{p 2 }} U {hspipi)}. 

Proof: Omitted. 

For a fc E {2, . . . ,n — 1}, an arrangement ^sp its A:-cell c, let R{c) be 
the set of horizon ridges of c, and G{R{c)) the incidence graph of R{c) (i.e. the 
vertices of G{R{c)) are horizon ridges, and there is an edge between them if both 
ridges meet). First note that for d > 2 the graph G{R{c)) is connected, since R(c) 
is the set of facets of some {d— l)-dimensional polytope. Thus, if a ridge r € R{c) 
is given, we can enumerate R{c) (starting at r) by tracing the graph G{R{c)). 
The problem to be solved here is to find the neighbors of r in R{c). This can 
be done by finding all hyperplanes bounding r in the central arrangement (we 
consider r itself as an (d— l)-dimensional polytope). As r lies in the intersection 
of two central hyperplanes, we can find all (central) hyperplanes bounding r by 
finding the non-redundant hyperplanes in (3) of 



( 1 ) 


a\x = bi 


(the visible hyperplane) 


( 2 ) 


02X = 62 


(the invisible hyperplane) 


(3) 


Ax < b 


(the remaining halfspaces h'^{p), p £ S) 



The latter problem is the redundancy removal problem (for r), which can be 
solved in time 0(nlp(d, m)), where m is number of non-redundant hyperplanes 
in (3) ([OSS95]). Assume that H is the set of all hyperplanes bounding a horizon 
ridge r of a cell c. Then a horizon ridge r' incident to r in G{R{c)) can be obtained 
by interchanging a visible hyperplane (1) with a visible hyperplane in H, or by 
interchanging the invisible hyperplane (2) with an invisible hyperplane in H. 
This is due to the fact that if d is a cell which shares d with c, then d is 
obviously also a k-ceW, as the numbers of invisible hyperplanes for both c and d 
are equal. 

In this way we can traverse G{R{c)) and find all polytopal neighbors of c. If 
He is the number of horizon ridges of a cell c, then the time required to find all 
neighbors of c is 0(ncnlp(d, n)). Recall that Uc < k{n — k) by Corollary 1(c). 

In , the graph G{R{c)) is disconnected, since the horizon ridges are points. 
We can find all horizon ridges of a cell c by computing all lines bounding c. This 
operation takes time O(nlogn), since it is equivalent to computing the convex 
hull of the points dual to the lines of ^aff • Then we traverse the sequence of these 
lines. A horizon ridge is detected if among two consecutive lines one is visible 
and the other not. 

In E^ , a horizon ridge of a cell c is incident to exactly another horizon ridge 
at each endpoint. This allows a more efficient computation of R{c). First, given 
the combinatorial information of c (its covector), we compute all vertices of c 
in time O(nlogn). For each vertex v we store all three planes containing v. In 
addition, for every two planes hi , /i 2 bounding c we store both vertices of c in 
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hi n h‘ 2 j if they exist. Now given a vertex in a horizon ridge, we check all three 
pairs of planes containing v and determine the two pairs forming the horizon 
ridges. For each of these two horizon ridges we look up in our data structure the 
vertex other than v. If both are already visited, we have determined all horizon 
ridges of c. Otherwise we choose a vertex not yet visited as the next vertex v. In 
this way we need constant time for each horizon ridge (once the vertices of c are 
known). Since the number Uc of horizon ridges of c is 0(n) (c has 0(n) edges), 
the computation of R{c) needs time 0(nlogn + Uc) = O(nlogn). 



3.3 Optimization on Qk{S) 

Let k £ {!,... , n} be fixed. Assume that the points in S are numbered according 
to their lexicographical sorting. For two sets Ti,T 2 G (f) we say that Ti is 
lexicographically smaller than T 2 if and only if the set of indices of the points 
in Ti is lexicographically smaller than the set of indices of the points in T 2 . We 
write Ti </ex ^ 2 - An analogous definition applies to the vertices v(Ti), v(T 2 ) 
of Qk{S) if Ti, T 2 are k-sets. Obviously {pi,... ,Pk} is the lexicographically 
smallest A:-set. We denote the corresponding vertex of Qk{S) by wq- 

In order to find a unique path from a vertex w of Qk{S) to wq, we determine 
first all polytopal neighbors of w as explained in Section 3.2. Then the successor 
of w on the path to wq is the lexicographically smallest vertex of Qk(S) among 
these neighbors. We repeat this step until we have reached wq. 

Note that each path found in this way terminates in wq since the edge graph 
of Qk(S) is connected and since finding a lexicographically smallest set among 
the polytopal neighbors of w is equivalent to minimizing a linear function, which 
is shown as follows. If c is a row vector of length d such that its i-th entry 
equals for z = 1, . . . ,d, then obviously c • v(Ti) < c • v(T 2 ) if and only if 
Ti <iex T ‘2 for Ti,T 2 G (^). The lexicographically smallest polytopal neighbor 
of T{w) is the neighbor with the smallest value c • v(T), so indeed the above 
method optimizes c over Qk(S). 

3.4 Reverse Search using polytopal neighbors 

Assume that k, Qk{S) and wq are the same as in previous section and that W 
is the set of vertices of Qk{S). Let us describe first the local search on Qk{S) 
(cf. [AF96]). Let / : — > W be the function which maps a vertex w G 

W\{it;o} to its successor on the unique path to wq in the way described in the 
previous section. Then we define a graph G with vertex set W whose edges are 
{w, f{w)} for each w G W\{u;o}. By considerations in the previous section G is 
connected. The local search as described in [AF96] is then given by the triple 
(G, {wo},/)- 

The adjacency oracle (A-oracle) Adj (see [AF96]) for the graph G can be 
implemented in the following way. 

(Al) The vertices of G can be represented by the sets T(w) ior w £ W or by 

integer encoding of A:-tuples of indices of the elements in T{w). 
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(A2) The integer S is at most k{n — k) by Corollary 1(c). 

(A3) The adjacency list oracle Adj{w,m) is implemented as follows. We com- 
pute all polytopal neighbors of the vertex w and sort them lexicographically 
by the indices of the points in the corresponding A:-sets. Then Adj{w,m) 
returns the m-th vertex in this sequence. 

The trace of a graph G is the directed subgraph T == (W,E{f)), where 
E(f) = {{w,f{w)) : w £ W\{u;o}}. The height of T is the length of the longest 
path in T. (The height of the trace in G might exceed the diameter of Qk{S), see 
Section 2.2). We can reduce the total time for executing the adjacency list oracle 
if we store the information about adjacent neighbors of a vertex w encountered 
for the first time during the Reverse Search in a data structure which allows 
access in time 0(1). In the same step we compute and store the value of f{w)^ 
which reduces the total time for computing /. This information is kept until all 
vertices of G in the subtree below w are visited. In total, the number of vertices 
for which the neighbors information is stored simultaneously is bounded by the 
height of the trace. 

This precomputation of / and Adj for w (together with sorting of 
the polytopal neighbors of w) requires time 0{nu,nlp{d,n) n^Aogriu,,) = 
0{riw nlp(d,n)), where n^, is the number of polytopal neighbors of w. The total 
time for computing the neighbor information of all vertices is then 

(total number of horizon ridges of all /i:-cells) 0(nlp(d, n)). 

Now, each call of Adj{w,m) takes time t{Adj) = 0(1) and each call of / takes 
time t(f) = 0(1). By the general complexity bound 0(1^1 6{t{Adj) -f- 1(/))) for 
the Reverse Search given in [AF96] [Corollary 2.3] we have 

0((total number of horizon ridges of all A:-cells) nlp(d, n) -h \W\ kn) 

as the total time complexity for Reverse Search using polytopal neighbors. Using 
the fact that the total number of horizon ridges of all A:-cells is at most k (n — 
k) \W\ and that \W\ = o(n^) ([ABFK92]), we obtain 

0(1 W| kn? lp(d, n)) = o{kn?^‘^ lp(d, n)) 

as an upper bound on the above-mentioned time complexity. 

In the time for computation of the neighbors for all vertices of Qk{S) is 
0(|W|nlogn). Since (5 < 2 in this case, the total time complexity of Reverse 
Search using polytopal neighbors is 

0(|W|nlogn). 

By the upper bound \W\ = 0{nk^/^) on the number of k-sets in the plane we 
have 0(|W|nlogn) = logn) for 

In we need time O(nlogn) for computing and sorting of all polytopal 
neighbors of a vertex. Since S = 0(n) in this case, the total time complexity of 
Reverse Search is 

0(|W|nlogn). 

By the bound \W\ — 0{nk?/^) on the number of A:-sets in this time complexity 

is 0{n?k^^^\ogn). 
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4 Reverse Search for fc-sets using combinatorial neighbors 

In the following let G {2, . . . ,n — 1} be fixed, and assume that the points in 

5 are numbered as described in Section 3.3. We say that two fc-sets Ti, in S 
are combinatorial neighbors, if |Ti DT 2 | = k — 1. The corresponding definitions 
apply to vertices of Qk(S) and cells in ^sp- Obviously a set Ti G (^) has at 
most fc(n — fc) combinatorial neighbors. We also say that two distinct sets Ti 
and T 2 in (^) differ by an {i,j)-flip if T 2 = {Ti\{pi}) U {pj} for some integers 
i,j G {1,... ,n}, i ^ j. If two sets Ti, T 2 in (^) are combinatorial neighbors 
then they differ by an (i, j)-fiip. If two fc-sets are polytopal neighbors, then 
by Corollary 1(b) they are also combinatorial neighbors, but the converse is not 
true. This shows the example in Figure 1: we can reach the 4-cell C 2 starting from 
Cl by passing the hyperplane hi visible for ci and the hyperplane invisible 
for Cl. (This corresponds to a (l,2)-fiip). Obviously Ci and C 2 are combinatorial 
neighbors, but since they do not share a horizon ridge, they are not polytopal 
neighbors (Lemma 2(a)). 



h\ /i2 




Fig. 1. The 4-cells ci, C 2 are combinatorial neighbors but not polytopal neighbors 



We can test as follows whether T' G (^) is a fc-set. The following linear 
inequality system is feasible if and only if T' is a fc-set. The system tests whether 
there is a hyperplane c' • x = 6 which separates T' and S\T'\ find c' G and 
6 G M, such that 

c' • p < b for all p G T' , 

c' ' p> b for all p G 5\T'. 

The time for solving the above system is at most lp(d -h l,n). 

4.1 The local search and the adjacency oracle 

Using the definitions of the Section 3.3, we show first how to obtain a unique path 
from any vertex of Qk{S) to the unique optimal vertex wq of Qk{S). Let W be the 
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set of vertices of Qk{S). We denote by / the function / : W\{u;o} — > W, which 
maps a vertex w to its successor in the unique path to wq. The vertex f{w) 
is computed in the following way. For each pair € {1, = .. ,n},i / j, 

let Tij be the set of (^) which differs from T(i(;) by the (i, j)-flip, if Tij exists. 
We denote by Z the family of these sets. Among the sets in Z we determine 
the lexicographically smallest one. The corresponding set is then tested 
for being a fc-set as described in Section 4. If this is the case, v(Tj/^/) is the 
value of f{w). Otherwise we discard Ti>j> and search for the lexicographically 
smallest set among the remaining sets in Z. This procedure is repeated until 
success. The time t{f) for computing f{w) is bounded by 0{kn\^{d l,n)), 
since \Z\ < k{n — k). 

Since each polytopal neighbor is also a combinatorial neighbor, there exists a 
path from a vertex w to wq- Furthermore, the above procedure yields a (unique) 
path ending in wq, since it can be regarded as the optimization on the poly- 
tope Qk{S) by analogous arguments as in Section 3.3. The finite local search is 
then the triple (G, f), where the graph G has W as a set of vertices and 

{{w, f{w)}\w € IF\{u;o}} as the set of edges. 

The time t{Adj) for evaluating the following adjacency oracle is clearly dom- 
inated by the time to test whether T' is a A:-set, and so t{Adj) = lp(d -f l,n). 

(Al) The vertices of G can be represented by the sets T{w) for w £ W or by 
integer encoding of A:-tuples of indices of the elements in T{w). 

(A2) The integer 6 is at most k{n — A:), as each vertex w £ W has at most so 
many combinatorial neighbors. 

(A3) The adjacency list oracle Adj{w,m) is implemented as follows. First, we 
determine the lexicographically m-th pair (i,j) among the pairs {(i,j)\i,j G 
{!,... ,n},i ^ j}. If the set T' £ (^) which differs from by the (i, j)- 

flip exists, then it is tested for being a A:-set. If this is the case, T' (or its 
integer encoding) is returned; otherwise the oracle returns 0. 

The running time of the associated Reverse Search (Adj,6, {wq}, f) is then 

0(1 W| 6{t{Adj) 4- t{f))) = 0(\W\ k‘^n^ lp(d + 1, n)) 

by [AF96] [Corollary 2.3]. 

4.2 Acceleration by storing adjacency 

Similarly as described in Section 3.4, we can compute and store the values of Adj 
and of / for each newly encountered vertex w until all vertices in the subtree of 
G below w are visited. We compute the set of combinatorial neighbors of w in 
the following way. For each pair (z, j), i, j £ {!,... ,n}, i ^ j we test whether 
the set Tij £ (^) which differs from T(w) by a (i, j)-flip exists and is a fc-set. 
The time for this computation is at most k{n-k)\p{d-\-l, n). Simultaneously, the 
value of f{w) is computed. This information is then stored in a data structure 
which gives the value of Adj{w,m) in time 0(1). In the whole Reverse Search, 
the total time for computing the neighbor information for each vertex is then 
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0{\W\kn\i){d + l,n)). Each call of Ad^{w,k) takes time 0(1), and each call of 
/ takes time 0(1). The total time for Reverse Search is then 

0(|kE| kn\p{d + l,n)). 



5 Affine fc-sets 

5.1 Basic properties 

We use the notation of Section 3.1. Given a set of points 5 and the corresponding 
spherical arrangement Asp, let us call as ^afr the arrangement of hyperplanes 
haff{p), p £ S. We can think about ^aff as about the central projection of the 
part of ^sp lying in the ’northern’ hemisphere of onto U. Let us call a cell of 
^sp which is not completely below the horizon an affine k-cell. Obviously there 
is a one-to-one mapping between the cells of Aa,s and the affine A:-cells of ^sp- 
We call a k-set of S an affine k-set if it corresponds to an affine k-ce\\ of ^sp- 
Under the assumption that the 0-cell of Asp contains the point (0, . . . ,0, 1) of 
5^ we can characterize the affine A:-sets in the following way. 

Lemma 3. Let k £ ,n}. Assume that the 0-cell of Asp contains the point 

(0, . . . ,0,1). Then a k-set P C S is an affine k-set if and only if P can be 
separated from, S\P by a hyperplane such that the origin of U is on the sam,e 
side as S\P. 

Proof: Omitted. 

If we allow a translation of the hyperplanes forming Aa,s (or, equivalently, 
a translation of points in 5), then the condition of Lemma 3 can be always 
satisfied: simply translate the hyperplanes forming in such a way that the 
origin of U is located in the 0-cell of ^aff- Thus we can assume in the following, 
that the point (0, . . . ,0, 1) is contained in the 0-cell of ^p. 

We can test whether a set T' £ (^) is an affine k-set in the analogous way 
as described in Section 4: T' is an affine A:-set if we can find c' £ and 6 G M, 
such that 



c' p <b for all p £T' 

d p>b for all p £ S\T' U {o}, 

i.e. if the corresponding inequality system is feasible. The running time for this 
test is lp(d -f- l,n -h 1). 

5.2 Reverse search for affine fe-sets 

The graph of affine k-sets of S has affine A:-sets as nodes, and two nodes share 
an edge if the corresponding affine A:-sets are polytopal neighbors. The Reverse 
Search algorithm can be applied to enumerating affine A:-sets only under the 
condition that the graph of affine A:-sets is connected, i.e. if one can reach each 
affine A:-set from any other affine k-set by moving over affine k-sets. 
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Unfortunately, this condition is not always fulfilled in and it is not known 
to be fulfilled in higher dimensions. In Figure 2 the solid polygon represents the 
4-set polytope Q4(5i) of the points with coordinates (-3.9, -6.8), (—9.1, —4.8), 
(3.1, -0.8), (—11.1, —6.8), (-1, -5), (11, 9), (10, 14). Except for the both (poly- 
topal) neighbors of v\ all vertices of Qa{S\) are affine. Therefore v\ cannot be 
reached from any other affine vertex of Qk{Si) and so the graph of affine 4-sets 
of S\ is disconnected. 




Fig. 2. The graph of the affine 4-sets of S\ is not connected 



We explain in the following how to obtain such counterexamples in E^ (and 
possibly also in higher dimensions). For a A: € {2, ... ,n}, consider the polytope 
Qk{S U {o}) and the set Xk{S U {o}) (where o is the origin). We have 

X,(SU{o}) = X,(5)UX,_i(5), 

since for a T E T contributes to Xk{S) if o 0 T, and T contributes 

to Xk-i{S) if o E T. Therefore, Qk{S U {o}) is a convex hull of the polytopes 
Qk{S) and Qk-i{S). 

Lemma 4. A vertex v of Qk{S) is also a vertex of Qk(S U {o}) if and only if v 
corresponds to an affine k-set. 

Proof: Trivial. 

Now we show that Qk(S) and Qk-i{S) can be translated relatively to each 
other by an arbitrary vector, thus allowing to ’hide’ some vertices of Qk(S) by 
moving them into the interior of Qk-i{S). We translate the points in 5 by a 
vector V E E^ and obtain 5' = {p + v|p E 5}. Each point x\ E Xk{S) becomes 
then xi -h kv, and each point X 2 G Xk-i{S) becomes X 2 4- (A: - l)v. Therefore, 
Qk{S') is the polytope Qk(S) translated by A:v and Qk-i(S^) is the polytope 
Qk-i(S') translated by (k - l)v. Relative to Qk(S), the polytope Qk-i{S) is 
translated by — v. 

By experimenting with some point sets and translation vectors v, we could 
easily find the configuration shown in Figure 2. There, both neighbors of vi are 
inside the dashed polytope Qs{Si), and we conclude that they are the only non- 
affine vertices of Q4(5i). It is also easily possible to enlarge Si and still keep its 
property of being a counterexample. 
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We conclude that the enumeration of affine k-sets in an efficient and output- 
sensitive way remains an open problem. A possible yet not output-sensitive way 
to enumerate the affine A:-sets of S is to enumerate all A:-sets of S and to check 
each output for being an affine k-set by one of the tests described in Section 
5.1. The enumeration of affine fc-sets can be used to solve the problem of finding 
the maximum feasible subsystem of linear relations Max FLS. It includes many 
interesting special cases such as the minimum feedback arc set ([AK95]). 
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Abstract. We study the C-oriented line simplification problem: Given 
a polygonal chain P represented by an ordered set of vertices pi , . . . , pn 
in the plane, a set of orientations and a constant e, we search for a 
“C-oriented” polygonal chain Q consisting of the minimum number of 
line segments that has distance at most e to P in the Frechet metric. A 
polygonal chain is C-oriented if the line segments are parallel to orienta- 
tions in C. We restrict our attention to the version of the problem where 
two circles of radius e formed around adjacent vertices of the polygonal 
chain do not intersect. We solve the C-oriented line simplification prob- 
lem constructively by using dynamic programming together with a nice 
data structure. For usual cases of C our algorithm solves the problem in 
time 0(kn^ log(n)) where k is the minimum number of line segments of 
Q and uses 0(kn^) space. 



1 Introduction 




Fig. 1. Subway lines (dashed) and C- 
oriented simphfied subway lines (|C| = 4). 




Fig. 2. The e-area around each subway line 
is shaded. 



Maps such as those used to describe subway lines, bus plans, cartographic 
schemas for gas, water or electricity mains are often drawn C-oriented. Each line 
segment in these plans is parallel to an orientation in a fixed set of orientations 
(vectors) C in IP? . We assume that for each c G C the orthogonal orientation to c, 
denoted as c, is element of C. This restriction is trivially fulfilled for an even num- 
ber of uniformly distributed orientations. Requiring the maps to be C-oriented 
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helps to make the maps look graphically clearer, more structured, and hence 
easier to read. Consider, for example, the creation of a subway line map given a 
city map. Typically, the number of C-orientations used would be 4: horizontal, 
vertical, and both diagonals. The subway line maintains geographical informa- 
tions like ‘‘north of”, “crosses”, etc. We restrict attention to one central task in 
the automatic generation of C-oriented maps: the C-oriented line simplification. 

Let pi,p 2 ,--*,Pn be n vertices in the plane. Let PjPj-\-i be a (closed) line 
segment from pj to Pj+i- We also call PjPj+i a link. We denote a polygonal chain 
P by P — (pi, . . . ,p„), it consists of line segments pjPj_^i, 1 < j < n. In the C- 
oriented line simplification problem we want to compute a C-oriented polygonal 
chain (i.e. each of the links is C-oriented) with a minimum number of links that 
represents P “well”. Representing P well means that the approximation leads 
along P and remains within e-distance of P, where e is a constant quality bound. 

Generalization and especially line simplification are well studied problems 
in cartography. See [Wei97] for a survey on cartographic generalization require- 
ments and techniques and [AG93,AG96] for a survey on geometric techniques 
that have been used to measure similarity or distance between shapes. Probably 
the most popular (but not C-oriented) line simplification algorithm is the Douglas 
Peucker Algorithm [DP73] and its improvement [HS92]. When the simplification 
line must also be C-oriented, the Douglas Peucker algorithm can not be applied 
directly because the line formed from two consecutive points in the line simplifi- 
cation may not be parallel to an orientation in C; the Douglas Peucker algorithm 
uses only such original data points. Guibas et.al. solved the (non C-oriented) 
line simplification problem under several optimization criteria [GHMS93]. They 
give an 0{n^ logn) time algorithm for finding an approximation that consists of 
a minimum number of links that has Frechet distance at most e to the original 
polygonal chain. In case that the e-circles around adjacent vertices of the original 
polygonal chain do not intersect, they provide a linear time algorithm. Adegeest 
et.al. give a 0(c^n\og^ n) time algorithm for computing minimum c-oriented 
link paths between a pair of points in the plane that avoids a set of n obsta- 
cles [AOS94]. In their definition of c-orientation c is a number greater or equal 
to two defining c unit vectors where each pair of clockwise adjacent vectors has 
the angle For a given starting point A, they iteratively determine the sets 
of all points that can be reached by paths of one link, two links, etc. This is 
the first approach solving the C-oriented line simplification. We use ideas from 
[GHMS93] and [AOS94], but we note that both methods can not be applied 
without major adjustments. 

2 Preliminaries and Problem Definition 

In this section we first introduce some necessary notations and definitions. Then, 
we formalize the C-oriented line simplification problem. 

For two points p and q we denote p^ as the vector [q — p). We distinguish 
between points and vertices of a polygonal chain: we call any q G PjPj+i a point of 
P and each pj a vertex.^ I < j < n. Let p and q be two points on a simple polygon 
P. If p precedes ^ on P we write p ^ q- Let P = (pi, . . .,p„) be a polygonal 
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chain. Call a circle with radius e and center pj the e-circle Cj of pj.The convex 
hull of two 6-circles Cj and Cj^i from two consecutive vertices pj and pj^i is 
called the tube Tj. Fig. 3 shows two tubes Tj, Tj+i, and a point q lying in both 
tubes but not in the e-circle they share. 

We need to distinguish between q G Tj and q G 
Therefore, to a tube Tj including the link 
PjPj^i we assign a level and to a e-circle Cj 
including its center pj we assign levels j and i — 1, 
if j > 1. All operations like fl, G, ^ now refer to 
their operands and their level. As a consequence 
P is simple, Tj HTj^i = and Tj HTj^i = 0, 
for / > 1. Let T = [Jj^lTj. (Fig. 2 shows a set of polygonal chains with the 
area of T shaded gray.) Clearly, each point in T has distance at most e to the 
polygonal chain. We require that the approximation lies in T, starts at a point in 
a start set 5 C (7i, ends at a point in an end set E C Cn^ uses only orientations 
in C, and leads through consecutive tubes. 

Problem 1 (C-oriented line approximation). 

Instance: Let P — (pi, . . .^pn) be a polygonal chain in IFC ^ let C be a set of 
orientations, e a positive constant, such that Cj fl Cj + i = 0, j = 1, . . . , n — 1, 
5 C Cl a connected set of possible start points and C C Cn a set of possible 
end points. 

Problem: Find a C-oriented line approximation of P. That is a C-oriented 
polygonal chain Q— such that qi e S, qm ^ E^ and qi G T, 

i = 1, . . . , m. Furthermore, link qiqPf^ intersects all Cj-|_i, . . . , C/ in this or- 
der, in case qi G Tj^ qi^i ^Ti^ j maximal, I minimal, and j < /. In case that 
qi G Ti^ qi^i G Tj^ j maximal, / minimal, and j < /, link qiqTCl intersects all 
C/, . . . , Cj-i-i in this order. 

Problem 2 (C-oriented line simplification (COLS)). 

Instance: An instance of the C-oriented line approximation problem. 
Problem: Find a C-oriented line approximation of P that consists of a minimum 
number of links. 




3 Reachable Regions 

In this section we introduce the concept of reachable 
regions^ modified from [AOS94]. For any tube Tj and 
any number of links i we will define a set that contains 
all points q G Tj for which a C-oriented link path ex- 
ists that starts in S and leads through Ti, . . . , Tj. For 
these sets we define a recursion formula that can be 
calculated in a dynamic program. 

Let p and q be two points. We say p reaches q w.r.t. c if there exists a A G fR 
such that q = p Ac. Locally in a tube we define the set of points that are 
reachable with one single link within a single tube as follows. 




Fig. 4. Horizontal 
reachable region of S. 
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Definition 1 (Reachable region TZTZ{S^c)). Let T be a tube, S C T, and 

c eC. We denote by TZTZ{S, c) the set of all points q £T for which there exists a 
point p £ S such that p reaches q with respect to c. For c) we simply 

write 1ZTI{S). 

Fig. 4 gives an example of a reachable region. Now, we extend the notion of 
a reachable region to allow multiple links within a sequence of tubes. Note that 
the following definition is consistent with the definition of Problem 1. 

Definition 2 (Reachable region lZlZ\S,Tj)). Let Tj be the tube of a 
polygonal chain P and S the start set. We denote by TZlZ\S,Tj) the set of all 
points q G Tj for which a C-oriented i-link path Q — {qi, . . . , qi) from S to q exists 
such that qi ET, I == 1, . . . , i. Furthermore, link qiqi+i intersects all . . . ,Cd 
in this order, in case that qi G Tj , qi^i G Td, j maximal, d minimal, and j < d. 
In case that qi G Td, qi+i G Tj, j maximal, d minimal, and j < d, link qiqi^i 
intersects all Cd^ • • • , Cj+i in this order. 

Note that 7^7^°(5,Tl) = 5, = 0 for j > 1 and 7^7^‘(5,Tl) = 

THl{S) in respect to tube T\, Observe that COLS always admits a solution. 

Property 1 (Existence), Every polygonal chain P has a C-oriented line simplifi- 
cation for arbitrary e > 0. 

We now introduce the concept of extensions. This allows us to extend a 
reachable region r C lZU'{S,Tk) into ll7l'{S,Tj) whenever possible, k / j. 
That is, we compute the c-oriented reachable region r of a c-oriented reachable 
region r', c ^ c, and try to extend it into as many consecutive tubes as possible. 
Fig. 5 shows an extension of a c-oriented reachable region from Tj-\ to Tj. Note 
that the extension in Fig. 6 is bounded by a boundary segment of Cj. A point q 
in the striped black region of Fig. 6 does not belong to the extension of r' from 
Tj-i to Tj, since a link from r' into that region does not cut Cj. 

We define a forward extension as an extension from a tube Tk into a tube Tj 
with k < j. Similarly, a backward extension is an extension from a tube Tj into 
a tube Tk with j > k. 



Fig. 5. The c-oriented reachable region r* Fig. 6. The extension of P is bounded by 
is extended (shaded dark). a segment of Cj in Tj (shaded dark). 

Definition 3 (Extension). 

Forward Extension TS{U,Tk,Tj): Let U CTk for some k < j. We denote by 
FS{U,Tk,Tj) the set of all points q G Tj for which a C-oriented link from 
a point p G U\Ck+i exists, such that pq intersects all Ck+i^ .. ,,Cj in this 
order. With FS{U,Tk,Tj,c) we denote the subset of FS{U,Tk,Tj) where all 
links are c-oriented, c G C. Then, FS{U,Tk,Tj) = FS{U,Tk,Tj,c). 
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Backward Extension BS{U^Tk^Tj): Let U C Tj for some k < j. We denote 
by BS{U, Tk,Tj) the set of all points q G Tk\Ck+i for which a C-oriented link 
from a point p e U exists, such that pq intersects all Cj,...,Ck+i in this 
order. 

We will show in the next lemma that we do not need backward extensions to 
reach all reachable points with a minimum number of links. This does not follow 
directly from the problem definition. We have to make use of the assumption 
that for each orientation c in C also the orthogonal orientation c is element of C. 

Lemma 1 (Backward Extensions). Let q G IZTl^ {S,Tj). There exists a C- 

oriented path Q = {qi, ^q^ = q) from qi e S to q such that each link qkqk+i is 

either contained in a tube Ti for some I < j , or qkqk-\-i intersects a// (7e+i, • • • , C/ 
in this order with qk G Te, qk+i ^ Tj , e maximal, f minimal, and e < f. 

Proof: Assume there exists a ^ G 1ZlZ'{S,Tj) that can only be reached with i 

links, when backward extensions are also considered. Let Q — {qi, ^q^ — q) 

be a path from q\ £ S to q. Then, there exists a minimum k, a maximum e, 
and a minimum / with e < f,qk G Te, qk+i G Tj such that qkqk+i intersects 
C/,...,Ce+i in this order. We show how this link, its predecessor link, and 



Fig. 7. The solid polygonal subchain can 
be replaced by the dashed one. 

successor link can be replaced by 3 other C-oriented links that are not backward 
extensions. Observe that Q has no self intersections, when we assign to a tube 
Ti a level i and to a circle Ci level i - 1 and level i, as introduced in Section 2. 

Note that Q enters the circle Ce+i three times. Let the intersecting points of 
Q n dCe+i he u ^ V ^ u' v' ^ u" ^ v" (see Fig. 7, 8). Since u can reach 
within Ce+i in at most 2 links we can assume that qk-i lies in a tube T^ w ith 
x < e and qk ^2 lies in a tube Ty with y > e. Therefore, uv, u'v', and u"v " are 
segments of Q and therefore C-oriented. Let uv have orientation c and let u^'v" 
have orientation c. 

We make a case distinction according to the location of u, v, u" and v": 
There exists a line am that cuts Pe+i? uv and u"v": In this case we claim 
that there exists di p £uv such that p -f Ac has non empty intersection with 





Fig. 8. The sohd polygonal subchain can 
be replaced by the dashed one. 




18 



1 /"^;". Let R—{x^ Ce+i|3y G uv^ A G iR such that x = y+Xc}. Observe that 
R is point symmetric to the midpoint Pe+i- Since u and v are positioned on 
opposite sides of am in Ce-|-i, it follows that each point on am can be reached 
with an c-oriented link starting at a point on uv. Since u"v" cuts am it 
follows that there exists a point z on m;, a point z' on and a A G /R 

with z' = z + Ac. Thus, replacing Q hy Q' - {qi, . . . , qk-i, z, z', g/e+ 2 , • • • , qi) 
yields that has as many links as Q and (q^i, . . . , qk-i’, z', qk-\- 2 ) has no 
backward extension. 

The points u^v^u" and v" lie on one half circle H of dCe+ii We make a 
case distinction according to the possible orders of u, i;, u", and v" on H: 
Order u", u, v'' and u, u", v and their reversals: See Fig. 8 for an exam- 
ple. Then, either there exists a A G iR and a z on such that z = u + Ac 
or there exists a A G ^ and a z on uF such that z = u" + Ac. In the first 
case, Q' — (^i, . . . , u', z, g'/c-|- 2 , . . . , g'j) is a link path with at most as 

many links as Q has and (^i, . . ., qk-u 2 :, qk-\- 2 ) has no backward exten- 
sion. In the second case Q' = (^ 1 , . . . , z, u"^qk-\- 2 ^ • • is a link path 
with at most as many links as Q has and Q' = (g^i, . . . , z, u", qk+ 2 ) has 
no backward extension. 

Order order and their reversals: In this case either 

link q^qk lies inside Tq or qk-\-iqk +2 lies inside Tg. In both cases qk+i can 
be reached without backward extensions. 

This procedure can be applied to all backward extensions in Q. Thus, all points 
in 7Z7V {S^Tj) are reachable without backward extensions. □ 

We now can express the calculation rule for the reachable regions as a recur- 
sion formula. 

Theorem 1 (Recursion Formula). For i^j > 0 we have 

j-i 

mi\s,Tj) = nn(Tm'-\s,Tj)) u \J f£(mi'-\s,Tk),Tk,Tj) 

k = l 

With RlV {S^Tj^c) we denote the subset oflZ1Z^{S^Tj) where the last link is 
c-oriented. Then, RlV {S,Tj) — Ucgc follows. 

Corollary 1. Let i be minimal such that E HlZlZ^S^Tn-i) / 0. The minimum 
C-oriented link path from S to E has i links. 

The sets lZlZ\S,Tj) can be computed by dynamic programming according 
to the Recursion Formula. Let q G TZTZ^ {S,Tj ,c). The C-oriented link path from 
q to S can be computed as follows: Starting at q we create a line ^ -h Ac, A G iR. 
The c-oriented reachable region of q is associated with its generating reachable 
region r' G TZlZ^~^{S,Tk,c). We compute a point q' G r' which lies on -f Ac. 
This point q' is the next vertex of the link path. Then, we repeat this procedure 
starting at q' and continue until we reach a point in S. 

Inserting each reachable region into a set lZlZ\S,Tj) yields that each re- 
gion r G lZ7Z^{S,Tj) generates at most |C| reachable regions in TZlV'^^{S,Tj). 
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Furthermore, each region r G 72.7?.* (5, Tj) can generate at most \C\ reachable 
regions in the extension step per tube / = j + 1, . . .n. Let /(i, i) be the number 
of reachable regions in 72.72.* (5, Tj), 1 < j < n. The following equations hold: 
/(^i) ^ \^\Ya=i fi^ ~ if ^ 7 > 1 /(l,i) = \C\. These recurrence 

equations lead to a number of reachable regions that grows exponentially in the 
number of line segments k of the minimum link path. 

4 Speed-Up Techniques 

In order to consider only a polynomial (in and |C|) number of reachable 
regions we study the form of reachable regions and classify them. This allows us 
to handle and store the reachable regions efficiently. 

Let r be a c-oriented reachable region in a tube Tj. Then, the boundary of r 
has the nice property that is determined by the boundary of Tj , the boundary of 
Cj and c-oriented lines. In case that Cj nCj^i ^ 0 the boundary of a reachable 
region can also contain boundary segments of non local circles, which would 
make the handling much more difficult. 

Theorem 2 (Type of a Reachable Region). Let S C C\ be a connected 

set of points in C\, Let c £ C. A c-oriented reachable region r in Tj can be 

categorized to have the following forms: 

1. r is determined by the intersection of two parallel c-oriented lines and the 
boundary ofTj. We say that type{r) = 1. (See Fig. 5.) 

2. r is a reachable region in Tj that is defined by a connected boundary segment 

Sj G dCj\dTj-i and a direction r G {c, — c}. r = {q ^ Tj \ 3q' E Sj ^ \ > 0 

such that q =: q' Ar}. We say that type{r) — 2. (See Fig. 6.) 

Due to space limitations, we provide only proof sketches. Note, all proofs are 
given in the technical report of this paper [Ney98]. 

Sketch of the proof: Let r be a reachable region in 7^7^* (5, Tk). For each case 
we show by induction on i [j respectively), that (1) lVR,{r^c) has form 1, (2) 
F£{r.,Tk^Tj,c) has form 1 in case that Ar < j — 1, and (3) (r, T/e, c) 

yields at most 2 reachable regions of form 2. □ 

We now show how to shrink the number of reachable regions by deleting 
redundant reachable regions and unifying reachable regions. 

Property 2. Let r be a c-oriented reachable region in 7^7^* (5, 7} , c). Then, r can 
be deleted in case that there exists a r' G 7^72.*(5, Tj, c) with r C r'. 

Property 3. Let r and r' be two c-oriented reachable regions in 72.7^* (5, Tj, c) 
with r n r' 0, r ^ r', and r' ^ r. Let r and r' be reachable regions of type 
1 or of type 2 with equal r. We can unify r and r' for the computation of the 
reachable region of r and r'. 

Note that we must store a back pointer from r U r' to r and r' since during the 
computation of the link path we have to follow the links backwards until we 
reach the start set S. 

Let c be the orthogonal orientation to c. The projection of a tube Tj to c 
yields an interval /(Tj,c). 
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Property Each c-oriented reachable region in Tj is uniquely represented by an 
interval in I{Tj , c) (namely the projection of the reachable region to c) and a 
tag specifying whether the reachable region is of type 1 or type 2 accompanied 
by the orientation r. 

In a further tag field a reference to the generating reachable region is stored. 
In order to store the reachable regions efficiently, we associate three sets of inter- 
vals with each IZlV {S^Tj^c): Jf(5, Tj,c, ±) contains all intervals corresponding 
to reachable regions of type 1. Tj^c^ -h) contains all intervals corresponding 
to reachable regions of type 2 with r = -f c and Tj^c^—) with r — —c. 

We insert a c-oriented reachable region r into 1Z7Z\S^ 7}, c) only if it is not yet 
contained in 7^7^* (5, 7}, c). In this case we delete all c-oriented reachable regions 
r' that are contained in r (see Property 2). Before we compute 
we unify the reachable regions in IZlV Tj^ c) for each c G C in order to shrink 
their number (see Property 3). Let x G — }. The insertion operations in 

Xi{S^ Tj^ c, x) have the property that each two intervals i\ and 2*2 in Xj (5, Tj^ c, x) 
are either disjoint or ^ 22 and i\ ^ Z 2 . We suggest handling each X* (5, Tj^c^ x) 
as an ordered list. Thus, the insertion, deletion, and search for an interval can 
be done in 0(log/) time, when / is the length of the list. Let u be an interval 
that contains k intervals of the list. Since these k intervals are consecutive in the 
list, the deletion or search of these intervals can be done in 0(log/ -f k) time. 

If we once have reached a point p E: Tj we can estimate the maximum number 
of iterations that we have to compute extensions into that tube. 

Property 5. Let p G 1Z1V{S^ Tj). Then, Cj C 7^7^*'^^(5, 7}, c), for each c G C. 

This ends our speed-up investigations. Recapitulating, we compute the reach- 
able regions 1Z1V{S^ Tj) by using dynamic programming, together with the inter- 
val lists as data structure, the insertion and update technique, and the sensitive 
computation of reachable regions. That is, we compute a c-oriented reachable 
region in a tube only in case when the tube is not yet covered with c-oriented 
reachable regions. Furthermore, we compute a forward extension into a tube Tj 
only in case when the circle Cj is not yet covered with reachable regions. 

5 Runtime and Space Calculation 

We now calculate the maximum number of reachable regions in each tube. Ac- 
cording to our Recursion Formula 1 we will estimate the number of c-oriented 
reachable regions in a tube T that is generated by 

1. non c-oriented reachable regions in T (see Lemma 2). 

2. forward extensions of non c-oriented reachable regions into T (see Corol- 
lary 2). 

3. c-oriented reachable regions in T (see Property 6). 

4. forward extensions of c-oriented reachable regions into T (see Lemma 3). 

With these calculations we are able to estimate the maximum number of reach- 
able regions in 7^7^* (5, 7}, c) (see Theorem 3) which leads to the runtime and 
space of the algorithm. 
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We call two reachable regions ri, T 2 of one orientation in a tube disjoint, if 
n n f 2 = 0, type(ri) /type(r 2 ), or type(ri) =type(r 2 ) = 2 and r(ri) ^ r(r 2 ). 
Since we unify overlapping reachable regions in TZTV'~^{S, Tj,c) before computing 
the sets TZTZ\S,Tj ,c), we only have to take into account the number of disjoint 
reachable regions in TZTZ\S,Tj,c). 

Lemma 2. Let be the c-oriented reachable regions in IZIZ^ {S,Tj ,c) . 

Let the angle between c and c be a. ri,...,r/e create at most disjoint c- 

oriented reachable regions in {S,Tj , c) . 

Sketch of the proof; Restricting our attention to the number of disjoint reach- 
able regions of r[ = riClCj, . . . ,r'f^ = rkCiCj gives an upper bound. Furthermore, 
we can assume that r'^, . . . , are ordered according to c, have minimum diame- 
ter, minimum distance to each other and r[ has distance e to pj. Trigonometric 
calculations then yields the minimum distance from to r[ (r^ to r^, . . . ), such 
that their c-oriented reachable regions do not intersect. □ 

Corollary 2. A maximum of . ](|C| - 1) disjoint c-oriented reachable re- 
gions can exist in 

u[Jiz\ TS{{nn\S,Tk)\nn\S,Tk,c)),n^ 

Property 6. 1. Each 7^7^* (5, Ti, c) contains one unique reachable region, c G C. 

2. Let r G 7ZlZ\S,Tj,c) with typ.e(r) = 1. Then, lZlZ{r,c) = r. 

3. Let r G IZIZ^ {S,Tj ,c) with type(r) = 2 and r G {-fc, — c}. Then, 7l1Z{r,c) D 
r and type{7ZlZ{r, c)) = 1. 

In case that a c-oriented reachable region has a c-oriented forward extension 
from tube Tk to tube Tj, then at most one c-oriented reachable region in each 
tube T/c_j_i, . . . , Tj_i has a forward extension into Tji 

Lemma 3. Let {S,Tk,c),Tk,Tj,c) ^ k < j < n. Then, TS[VJV [S, 

Ti,c),Ti,Tj,c) contains at most one reachable region for any I , k < I < j . 

Theorem 3. 7^7^* (5, Tj , c) contains at most ‘^j\j^—^[\C\ — l) disjoint reachable 
regions. 

Sketch of the proof: The number of reachable regions in IZIZ^ (S,Tj , c) 
is the sum of the reachable regions in: (1) 7^7^(7^7^*(5, 7j)\7^7^*(5, , c), c), 

(2) Ui;l•^^((^^'(‘5',T,)\7^7^'(5,T,,c) (3) 7^7^(7^7^'(5,T,,c),c), and (4) 

(j{z\ PS{UW{S,Tk,c),Tk,Tj,c). By induction on j we show that the sum of all 
reachable regions is bounded by ^j\ ~ !)• ^ 

We measure the distance between two points p — (xi,yi) and q = (^ 2 , ^ 2 ) in LR^ 
in the L 2 metric. That is, ||p — ^^112 •= ({^i ~ ^ 2 )^ A {vi ~ y 2 )^)^ • The estimation 
of the number of reachable regions leads to the running time and space needed 
by the algorithm. 
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Theorem 4. Let imax == maxj- 2 ,...,n{ 2 e running time of 

the algorithm is bounded by 0{n‘^\C\‘^Lmei^\ 2 J . 1 lQg(^IQI|[ 2 a^ • 1))^ needs at 
most 0(n^|CpXmaxr 2 ~ a~ 1) Expressed in the minimum number of links k 

of COLS the running time is bounded by 0{kn^\C\^\ . ] log(n|C| [ —- ■■-])) and 
the space by 0{kn‘^\C\^\ 2 J .^ 1)» 

6 Relation to the Frechet Metric 

We note relationships to the Frechet metric for curve similarity and line sim- 
plification [AG93,God91,God98]. Intuitively, two curves a and (3 have Frechet 
distance at most e, if a person walking along a can walk a dog along /S with 
a leash of length e. More precisely, two curves have distance at most e under 
Frechet metric if and only if they have monotone parameterizations a and (3 
(a,/? : [0,1] -> /R^), such that \\a{t) — (3{t)\\2 < e for all t G [0,1]. Fig. 2 
shows for each polygonal chain a minimum C-oriented line simplification that 
has Frechet distance at most e. 

In this section we show that a minimum C-oriented line simplification with 
Frechet distance at most c to P has as many links as a minimum C-oriented line 
simplification has. Furthermore, we show how to obtain a minimum C-oriented 
line simplification with Frechet distance at most e to P from the reachable re- 
gions. In the next definition we define the neighborhood of a point q as the 
connected component of the intersection of P with the e-circle around q. 

Definition 4 (Neighborhood N{q)). Let P — (pi,...,pn) be a polygonal 
chain and Q = (g'l, . . .,^m) be a C-oriented line approximation of P . Let q be a 
point on Q, q £Tj, j minimal, and I < j < n — 1. 
q G Tj\Cj^i: Let N{q) = {x G PjPj-\-i such that \\q — x \\2 < e}. 
qeCj^i: Let N{q) = {x G PjPj-\-i ^Pj+iPj +2 such that ||^ — a :||2 < c}. 



Let Q be a curve with Frechet distance at most e to P. From this definition 
follows that P and Q have monotone parameterizations a and f3 such that a{t) 
is in the neighborhood of ^{t). 

Property 7. 1. N{q) is a continuous subset of PjPj+i or (pj, pj+i, pj_^ 2 ), resp. 

2. Let Q have Frechet distance at most e to P. Let a and (3 be monotone 
parameterizations of P and Q,such that ||a(/) — /3(t)\\2 < e for all t G [0, 1]. 
If ^{t) — q then a{t) G N{q), t G [0, 1]. 

A C-oriented line approximation Q of P has Frechet distance at most e to P 
if and only if the neighborhood of Q contains P and for any two points q' and 
q" on Q where q' precedes q" on Q the neighborhood of q'' does not precede the 
neighborhood of q" on P: 

Lemma 4. Let P = (pi, . . . , Pn) be a polygonal chain and Q = (^ 1 , . . . , be 
a C-oriented line approximation of P. Q has Frechet distance at most e to P <=> 

(1) for all points p on P there exists a q on Q such that p G ^rid 

(2) for all points cf ^ cf' on Q we have that N{q'^) / N{q'). 
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Any C-oriented polygonal chain with Frechet distance at most c to P is a 
C-oriented approximation of P: 

Theorem 5. Let P — (pi, . . .,Pn) be a polygonal chain. Let Q = (g^i, . . .qm) be 
a polygonal chain with qi E S, Qn E. E and qiqj^ G C. Furthermore, let Q have 
Frechet distance at most e to P. Then, Q is a C-oriented approximation of P . 

After these preparations the main theorem of this section can be proven 
similarly to the proof of Lemma 1. 

Theorem 6. Let P = (pi, . . . , p„) be a polygonal chain and Q — {qi, . . . , qm) be 
a C-oriented line simplification of P . Then, there exists a C-oriented line simplifi- 
cation Q' luith Frechet distance at most e to P that has as many links as Q has. 

Remember that we construct the C-oriented line simplification by following 
the links backwards until we reach the start set S. Following a link backwards 
until it hits a point in its generating reachable region for the first time yields an 
approximation with Frechet distance at most e to P. This is true since this way 
of processing disables the transformation described in the proof of Theorem 6. 

7 Conclusion 

We presented an algorithm that solves the C-oriented line simplification problem 
with the additional restriction that the C-oriented polygonal chain has Frechet 
distance at most e to the original chain in polynomial time. 

Theorem 7. The C-oriented line simplification problem (COLS) with the ad- 
ditional restriction that the C-oriented polygonal chain has Frechet distance at 
most e to the original chain can be solved in 0{kn^ [ 2 a • " 1 lQg(^|^| . 1)) 

time. The algorithm needs 0{kn^\C\^\ space (k is the minimum number 
of links, k < 4n 

For arbitrary sets of orientations we have a negative result: 

Corollary 3. In case that C does not contain the orthogonal orientation to each 
orientation, the minimum C-oriented line simplification can have fewer links than 
the minimum C-oriented approximation that has Frechet distance at most e to P. 

Proof: Fig. 9 shows a polygonal chain and its minimum C-oriented line simpli- 
fication that has Frechet distance more than e to P since ^ on P, 

with q' -< q" . Each C-oriented approximation that has Frechet distance at most 
6 to P consists of at least 4 links. □ 

In case that C does 
not contain the orthogonal 
orientation to each orien- 
tation our algorithm still 
computes a C-oriented ap- 
proximation but the re- 
sulting approximation might Fig. 9. Example for Corollary 3. 
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neither be minimal, nor have Frechet distance e. In order to gain minimality it 
is necessary to allow backward extensions from a tube Tj to Tk with k < j. An 
open question is how to compute the minimum C-oriented approximation that 
has Frechet distance at most e to P in this case. 

The restriction that the e-circles of two consecutive vertices of the original 
polygonal chain do not intersect is necessary when we want to solve this problem 
by computing the sets TZTZ^ (S^Tj) in polynomial time. There exist examples 
where the number of disjoint reachable regions in TZlV[S^Tj) gets exponential 
in j when allowing (7/ fl C/ 4.1 ^ 0. Since the profit of deciphering such an example 
is rather small we decided not to expose them. The complexity of the C-oriented 
line simplification problem when allowing Cj C\ Cj^i ^ 9 remains open. 
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Abstract. Given a graph G with weighted edges, and a subset of nodes 
T, the T-join problem asks for a minimum weight edge set A such that a 
node u is incident to an odd number of edges of A iff 7/ € T. We describe 
the applications of the T-join problem in sparse graphs to the phase 
assignment problem in VLSI mask layout and to conformal refinement 
of finite element meshes. We suggest a practical algorithm for the T- 
join problem. In sparse graphs, this algorithm is faster than previously 
known methods. Computational experience with industrial VLSI layout 
benchmarks shows the advantages of the new algorithm. 

1 Introduction 

Given a graph G with weighed edges, and a subset of nodes T, the T-join Prohlew, 
seeks a minimum weight edge set A such that a node u is incident to an odd 
number of edges of A iff ly. C T. One can find a discussion of the T-join problem 
in Cook et al. [4], pp. 166-181. 

In this work, we develop a new exact algorithm for the T-join problem which 
is motivated by the applications in VLSI mask layout. Section 2 describes the 
context of the phase assignment problem in VLSI phase-shifting masks. The 
corresponding graphs are sparse, with a large number (up to millions) of nodes. 
Similar graphs appear in conformal refinement of finite element meshes. 

A traditional reduction of the T-join problem to minimum weight perfect 
matching is too time- and memory-consuming to be practical. In Section 3 we 
suggest a new reduction to the perfect matching problem which increases the 
size of the graph by at most a factor of two. This reduction is linear and does not 
contain any large hidden constants. The achieved runtime is 0((n log n)^/^)a(n), 
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was partially supported by NSF Grant CCR-9700053 and A. Zelikovsky was partially 
supported by GSU Research Initiation Grant ^^00-013. 
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where a is the inverse Ackerman function and n is the number of nodes in G. In 
the concluding Section 4 we describe our computational experience with layouts 
derived from standard-cell VLSI designs obtained from industry. 

2 Phase Assignment in VLSI Phase Shifting Masks 

In the manufacture of a given VLSI circuit layout, photosensitive material is 
exposed to light that is passed through a m,ask. Without loss of generality, clear 
regions in the mask correspond to desired shapes, or features, in the layout. 
Phase-shifting mask (PSM) technology, proposed by Levenson et al. [11] in 1982, 
enables the clear regions of a mask to transmit light with prescribed phase shift. 
Given two adjacent clear regions with small separation, and respective phase 
shifts of 0 and 180 degrees, the light diffracted into the nominally dark region 
between the clear regions will interfere destructively] this gives improved image 
contrast (i.e., between light and dark) and better resolution of the two features. 
PSM is enabling to the subwavelength optical lithography upon which the next 
several VLSI process generations depend [17]. 

Two positive constants b < B define a relationship between manufacturability 
of the layout and the distance between any two given clear regions [15]. The 
distance between two features cannot be smaller than b without violating the 
minimum spacing design rule. If the distance between two features is at least b 
but smaller than B, the features are in phase conflict,^ which can be resolved by 
assigning opposite phases to the conflicting features. In other words, B defines 
the minimum spacing when two features have the same phase, while b defines 
the minimum spacing when the features have opposite phases. If the distance 
between two features is greater than B, there is no phase conflict and any phase 
assignment is allowed. 

The Phase Assignment Problem: Given a layout, assign phases to all 

features such that no two conflicting features are assigned the same phase. 

Given a layout, consider the conflict graph G =< V, E > which has a vertex 
for each feature, and an edge between two vertices iff the corresponding fea- 
tures are in phase conflict. Observe that the Phase Assignment Problem can be 
solved iff the conflict graph is bipartite, i.e., has no odd cycles. The only way 
to change the conflict graph is to perturb the layout, e.g., perturb the locations 
of features such that they are no longer in phase conflict. Thus, if the conflict 
graph is not bipartite, we seek a minimal perturbation of the layout such that 
the conflict graph in the new layout is bipartite. The following method for layout 
modification and phase assignment was proposed in [10], extending work of [15]. 

(i) given a layout, find the conflict graph G; 

(ii) find a (minimum) set of edges whose deletion makes the conflict graph G 
2-color able; 

^ More precisely, two features are in phase conflict if (i) there is no pair of points, 
one from each feature, whose separation is less than 6; and (ii) there is some pair of 
points, one from each feature, whose separation is less than B. 
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(iii) assign phases such that only the conflict edges in this (minimum) set connect 
features of the same phase; and 

(iv) com, pact the layout with “PSM design rules”, i.e., apply a layout compaction 
tool that enforces separation at least B between features that are assigned 
the same phase, and separation at least b between features that are assigned 
different phases. 

In this approach, the key step is determining which set of edges in the conflict 
graph correspond to a “minimum perturbation” of the layout. 

The Minimum Perturbation Problem: Given a planar graph G =< V, E > 
with weighted (multiple) edges. And the minimum- weight edge set M such that 
the graph < V, E — M > contains no odd cycles. 

The Minimum Perturbation Problem can be reduced to the T-join problem 
in the following way. We use the following definitions. A geom,etric dual of an 
embedded planar graph G =<V,E > is a, multigraph D =< F,E > in which 
nodes are the faces of G. If f,g are two faces of G, i.e. two nodes of D, than an 
edge of G connects / with g if it belongs to both of them. A reduced dual of G is 
a graph D —< F,E > obtained from D by deleting all but one of the edges that 
connect a given pair of nodes. The undeleted edge must be the one of minimal 
weight. 



Lemma 1 The Minim, um, Perturbation Problem, for a planar graph G is equiv- 
alent to the T-join problem, in the reduced dual graph of G. 

Proof. To eliminate all odd cycles it is sufficient to eliminate odd faces of the 
planar graph G (see Figure 1). The odd faces of G form odd-degree vertices of D. 
Any edge elimination in G corresponds to edge contraction in D. In particular, 
if we eliminate a set of edges A in G, then the resulting nodes of (modified) D 
will correspond to connected components of < F, A >. Given such a component 
with sum of node degrees d and k edges, the corresponding node has degree 
d — 2k. Thus A is a feasible solution iff each connected component of < F, A > 
contains an even number of odd nodes (odd faces of G). Moreover, for each 
feasible solution A C E there exists a feasible solution A C E with weight that 
is not larger; we obtain A from A by replacing multiple edges connecting a pair 
of nodes/faces / and g with a single edge of minimum weight. 

If we define T to be the set of odd faces of G, then finding the minimum 
cost feasible solution is the same as solving the T-join Problem for F. □ 

After the Minimum Perturbation Problem is solved, i.e., the set of edges M 
is determined and deleted, the valid assignment of phases can be found using 
breadth-first search. For each connected component of the conflict graph (the 
weight of each edge is set to 1), starting from arbitrary vertex v breadth-first 
search determines the distance from v to each other vertex u. If the distance from 
to is even, then u is assigned the same phase as v] otherwise, u is assigned 
the opposite phase. Such breadth-first search can be performed in linear time. 
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Fig. 1. Prom the conflicts between features (a), the conflict graph is derived (b). The 
dual graph (c) is constructed. The vertices of odd degree are matched using paths in 
the dual graph (d), and the corresponding conflict edges are determined (e). Finally, 
the minimum set of conflicts to be deleted is determined (f). 



Quadrangulations for Finite Element Meshes 

Another application of the sparse T-join Problem is described in [14]. Confor- 
mal mesh refinement has gained much attention as a necessary preprocessing 
step for the finite element method in the computer-aided design of machines, 
vehicles, and many other technical devices. For many applications, such as tor- 
sion problems and crash simulations, it is important to have mesh refinements 
into quadrilaterals. The problem of constructing a minimum-cardinality confor- 
mal mesh refinement into quadrilaterals is well known. This problem is NP-hard 
and for meshes without so-called folding edges a 1.867-approximation algorithm 
is suggested in [14]. This algorithm requires 0(nm log n) time, where n is the 
number of polygons and m the number of edges in the mesh. The asymptotic 
complexity of the latter algorithm is dominated by solving a T-join problem in 
a certain variant of the dual graph of the mesh. Although the T-join problem 
can be solved fast for planar graphs by an application of the planar separator 
theorem (see [12] and [2]), our reduction is much simpler and does not contain 
any large hidden constants. 

3 A Fast Algorithm for the T-join Problem 

The T-join problem was solved by Hadlock [8] and Orlova &: Dorfman [16] using 
the following reduction. 

Lemma 2 The T-join problem for a graph with n nodes can be reduced to 
Minmum- Weight Perfect Matching problem, in a complete graph with \T\ nodes. 
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Proof. Every minimal T-join is the union of edge sets of edge disjoint paths 
that, viewed as edges connecting their endpoints, provide a perfect matching 
of set T (see [4], p. 168). Thus every minimal T-join corresponds to a perfect 
matching, with the same cost, in a complete graph with node set T and edge 
weights defined as the shortest path lengths in the original graph. Conversely, 
every perfect matching in the new graph yields a T-join considering the paths 
that correspond to its edges, and taking the edges of the original graph that 
belong to an odd number of these paths, obviously the cost of this T-join is not 
larger than the cost of the matching. Consequently, the minimum cost perfect 
matching must correspond to a minimum cost T-join. □ 

The reduction defined in Lemma 2 has two drawbacks. First, the reduction 
itself can be slow, because finding all pairwise distances between vertices of 
T is too time- and memory-consuming. Additionally, the resulting instance of 
Minimum-Weight Perfect Matching Problem may have many more edges than 
necessary, and thus itself is too difficult to be used in practice. The present work 
provides an approach that is much more efficient in the case of sparse graphs 
(note that planar graphs are always sparse, because the number of edges is less 
than six times larger than the number of nodes). 

In this section we present a faster reduction of the T-join problem to the 
minimum weight perfect matching problem, which yields a faster exact algorithm 
for the T-join Problem in sparse graphs. 

3.1 Opportunistic Reductions 

In this subsection we describe simplifying, “opportunistic” reductions that serve 
to normalize input graphs for the reduction from the T-join problem to perfect 
matching that is described later. These reductions do not improve the worst case 
performance of algorithms for the T-join problem, but nevertheless help in many 
real-life instances. 

The first opportunistic reduction reduces the T-join problem to instances 
with biconnected graphs. 

Theorem 1 Consider an instance of the T -join problem, described by the graph 
< V,E >, edge weight function w and T C V. Assuw.e that < V,E > has 
biconnected com,ponents < Vi,Ei >,...,< Vk,Ek >. Then in linear tim,e we 
can find sets Ti C such that A C E is an optim,al T -join if and only if for 
z = I, . . . , A:, An E{ is an optim,al Ti-join for <Vi^Ei > and w\Ei - 

Proof. If a biconnected component < Vi^Ei > happens to be a connected 
component, then for obvious reasons it suffices to define Ti — T C\Vi. Similarly, 
the claim is trivial if < V, T > is biconnected. Now consider < Vi,Ti >, the 
first biconnected component reported by Hopcroft’s algorithm (see [I], pp. 180- 
187); it is a property of this algorithm that this component contains exactly one 
articulation point, say v. Let Eq — E — E\, and Vq — V — ViC\ {i;}. We will find 
sets Ti and Tq such that A is a T-join for < y, T > if and only if Tj fl Ej is a 
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solution for <Vj,Ej > for j = 0, 1. We have four cases. In the first two, v £T. 
If |Tn Vi| is even, v must be incident to an odd number of edges from Ei, and 
thus to an even number of edges from Eq. Thus we can set T\ — T C\V\ and 
To = T — Vi . If |T n Vi I is odd, then v must be incident to an even number of 
edges u\ A C\ El and thus to an odd number of edges from A D To, consequently 
we can set Tq — T C\Vq and T\ = T — Vb- In the remaining two cases, v ^T.\i 
|T n Vi I is even, v must be adjacent to an even number of edges from AC\Ei 
and an even number from A D To, consequently we can define Tj —T C\ V} for 
j = 0, 1 . If T n Vi is odd, V must be incident to an odd number of edges in both 
A n To and A fl Ti , so we can define Tj =T C\ Vj U {i;} for j = 0, 1. 

In this fashion, we can each compute Ti as soon as the respective biconnected 
component < Vi,Tj > is reported by Hopcroft’s algorithm. □ 

Another opportunistic reduction eliminates nodes of degree 2 that do not 
belong to T. 

Theorem 2 Assum,e that node v ^ T has exactly two neighbors, vi and V 2 - 
Consider the graph transform, ation where edges {v,vi} and {v,V 2 } are replaced 
with edge {vi,V 2 } with weight w(v,v\) -\-w{v,V 2 )- Then this edge replacement 
defines a 1-1 correspondence between T -joins of the old graph and the new graph. 



Proof. The claim follows immediately from the observation that in the 
original instance, a solution either contains both ei and 62 , or neither of these 
edges. □ 

Because the running time of the most efficient algorithms for minimum weight 
perfect matching depends on the maximum edge weight (if we assume that all 
weights are integer), we should estimate how much this weight may change. The 
reduction implied by Theorem 1 does not change the maximum edge weight at 
all, while the reduction implied by Theorem 2 increases the maximum by a factor 
smaller than n. 

3.2 Reducing T-join to Perfect Matching with Gadgets 

In this subsection, we develop a new and more efficient reduction of the T- 
join problem to perfect matching, using gadgets. The general outline of our 
reduction is the following. For each instance {G,w,T) of the T-join problem we 
will construct an equivalent instance {G',w') of the perfect matching problem. 
Each node v will be replaced with a gadget graph Gy that is connected with 
edges of weight 0. Later we will call these edges connections. Each edge {u,v} 
will be replaced with 1, 2, or 3 edges that connect Gu with Gy. We will call these 
edges replacem,ents. Each replacement has the same weight as the corresponding 
original edge. 

From the previous subsection, we may assume the following restrictions on 
instances of the T-join problem: the graph is biconnected, \T\ is even and posi- 
tive, all T-nodes have degree at least 2 and all other nodes have degree at least 
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3. Henceforth, we will use S to denote V -T. The correctness of the translation 
is assured by the following strong equivalence condition. For each perfect match- 
ing M in G' there exists a solution A in G with 1-1 correspondence between 
replacements in M and edges in A where each edge e G A corresponds to one of 
its own replacements. Conversely, for every solution A in G there exist a perfect 
matching M in G' with such correspondence. 

We will assure the strong equivalence using the following lemma. 

Lemma 3 Properties (1), (2) and (3) are sufficient to assure strong equivalence 
between G and G : 

(1) For any edge {u,v}, there is a node which is incident to all replacem,ents of 
{u,v}. If this node belongs to Gy, then we say that {u,v} fans out from, v 
towards u. 

(2) If u G T then Gy contains an odd number of nodes, and if u £ S then Gu 
contains an even num>ber of nodes. 

(3) Let Au be a set of edges that are incident to som,e node u of the graph G. 
Assum.e that \Au\ has the sarn,e parity as \Gu\- Then all nodes of Gu are 
included in a m,atching Mu that consists only of the replacem,ents of the 
edges of Au and the connections of Gu- 

Proof. Given a matching M in G', we obtain a corresponding T-join A 
in G by discarding all connections, and exchanging the replacements for the 
“original” edges. Property (1) assures that there is a 1-1 correspondence between 
replacements in M and edges in A since only one edge from replacement can 
participate in a matching. Note that each replacement can match at most one 
node in Gu and connections in Gu can match only even number of nodes. Thus 
property (2) assures that if u £ T, then A contains an odd number of edges 
incident to u, and if ly. G 5, then A contains an even number of such edges. 

It remains to show a converse relationship. Consider a T-join A in G. 
By property (3), we can find a matching Mu for every group of edges Gu-> 
so it remains to show that we can combine these matchings together. By 
property (1), every edge {v,u} with more than one replacement has all its 
replacements incident to a single node; if this node is in Gy, then this edge 
fans out from v toward u. Let us remove from each Mu the replacements of 
edges that fan out from u and take the union of the reduced MuS. If a node 
w is not matched, it must belong to some Gu, and its incident edge from Mu 
was removed, because it was a replacement of an edge that fans out from 
u toward some v. However, in this case, {u,v} £ A and one of the replace- 
ments of this edge must still belong to My, moreover, it must be incident to w.Q 

Properties (1) and (2) can be immediately verified for a given construction of 
graph G'. Property (3) will be proven by induction on the degree of u. However, 
as a preliminary step, we must show that we can simultaneously provide a gadget 
Gy for every node v of G. The limitation is that almost every gadget requires 
that a certain number of edges adjacent to v fan out toward v. However, this 
requirement never exceeds half of the total number of incident edges (degree of 
the node). Thus, before proceeding further, we should show that 
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Lemma 4 We can fan out the edges of G in such a way that if a node has degree 
2k or 2k + 1, at least k edges are fanned out toward it (see Figure 2). 

Proof. By induction on number of edges in G. We first consider the case 
when G contains a simple cycle. Then we fan out the edges on this cycle so each 
is fanned out toward a different node, and remove the edges of this cycle. For 
each affected node, the degree decreases by 2, and the number of edges that fan 
toward such a node decreases by 1. 

If G contains no cycles, then it is a forest; we can take an edge that is incident 
to a leaf, fan it out toward its other end and remove this edge. Two nodes are 
affected: a leaf which does not have any requirements (A; == 0), and the other 
node, for which the degree decreased by 1, so that the requirement for more 
edges fanned out toward it decreased by 1 as well. □ 





Fig. 2. An example of graph transformation. In the original graph, node labels indicate 
the member of T and S respectively, arrows on edges indicate the direction of the 
possible fan out. The thick edges will be fanned out during the transformation. 



The gadgets that we use are formed from three kinds of building blocks. 
A gadget for node v is defined as the graph H that consists of G^^ plus the 
adjacent replacement edges. If the replaced edge fans out toward we keep all 
the replacements, and otherwise we keep only one. Because we define gadgets 
first, and assign them to various nodes later, we will use core{H) to denote Gy, 
and rind{H) to denote H — core{H). In general, a gadget H is characterized by 
the degree of its node and by the membership of this node in T or 5. We will use 
acronyms to identify gadgets, e.g., T4 are gadgets for elements of T that have 
degree 4. 

We can now formulate the sufficient conditions for the correctness of a gadget 
that are implied by Lemma 3. Property (1) of Lemma 3 is assured as follows: a 
Ti or Si gadget has \rind{H)\ = i\ each edge incident to the node represented 
by the gadget corresponds to one of the nodes of rind{H)\ if this edge fans out 
toward that node, this node is connected with core{H) by all its replacements, 
otherwise it is connected with core{H) by a single edge. 
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Property (2) is assured simply if \core{H)\ is odd for a T gadget, and even 
for an S gadget. Finally, property (3) means that if core{H) C U C H and \U\ 
is even, then H contains the matching that matches all nodes of U and no other 
nodes. 

Now we will describe construction of the gadgets. We first define three basic 
gadgets, 53, T3 and Q, which is actually a variety of T4 (see Fig. 3). Gadget 
T2 is a degenerate case, because we do not modify T-nodes of degree 2 (except 
than an edge originating in such node may fan out toward its other end). Fig. 3 
shows how we form gadgets for all nodes of degree below 7. For nodes of degree 
more than 6, the gadgets are constructed recursively using a procedure described 
below. 

Given two gadgets H and i7', we can w,eld them as follows. Let {x,y] be 
a replacement of an edge that does not fan out toward the node of i7, and let 
y G core{H). Select {x' ,y^} similarly. Then meld{H, H') is created by discarding 
X and x', and by identifying y with y\ Fig. 3 shows several examples of melding. 
For i > 7 we define Si as meld{S{i — 2),Q) and Ti as meld{T{i — 2), Q). The 
following lemma validates building larger gadgets by melding the smaller ones. 

Lemma 5 We can build new gadgets in the following three ways: 

(i) If H is an Si gadget, and H’ is an Sj, then meld{H, H') is a T{i + j — 2). 

(ii) If H is an Ti gadget, and is an Tj, then meld{H,H') is a T{i + j — 2). 

(in) If H is an Si gadget, and H’ is an Tj, then meld{H,H') is a S{i j — 2). 

Proof. Let Hq denote meld{H, H'). We will prove only (i), the other cases 

being similar. Property (1) of the gadget correctness is inherited from H and 
H', because edges that are fanning out in Hq were fanning out in H or H' and 
they are represented as before. One can also see that |rind(iLo)| = ^ j — 2, 
so Hq represents a node of degree i -\- j — 2. Property (2) follows quickly 
from the fact that |core(jftTo)| = \core{H)\ -f \core{H‘)\ — 1. To prove property 
(3), consider Uq such that core{Ho) C Uq C Hq, such that \Uq\ is even. Let 
U — Uor\rind{H) and = UoC\rind{H'). Because \core{Ho)\ is odd, |f/| + |[/'| 
is also odd. Without loss of generality assume that \H\ is even and \H'\ is 
odd. Because if is a correct Si gadget, the subgraph of core{H) U U contains 
a perfect matching. Now it remains to find the matching for core(H') U U' . 
We first obtain a matching for core(H') UlT U {x'}. We then remove the edge 
{x^,y‘}, because in H' node x‘ has degree 1, hence this edge must belong to 
our matching. Note that x^ was discarded during melding, and y' = y is already 
matched, so we have matched all the nodes of Uq. □ 

The following theorem estimates the quality of our gadget reduction of the 
T-join Problem to the Minimum Cost Perfect matching. 

Theorem 3 Consider an instance of Minim, um, Cost T-join problem, with n 
nodes, m edges and uq nodes of T that have degree 3. In linear tim,e we can 
generate a strongly equivalent instance of the Minim,u,m, Cost Perfect m,atching 
that has at m,ost 2m nodes and at m,ost 6m — 5n + 0.5no edges. 
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53 23 IT 54 = meW(53,23) • 

2 r4 = meW(53,53) 

55 = meld(S3,Q) S6 = meld(S3,Ti,S3,S3) 

T5 = meld{S3,T3,S3) T6 = meld(S3,Q,S3) 



Fig. 3. Replacing a vertex v with Gv Empty circles indicate the nodes of Gu^ solid 
circles indicate nodes that are connected to Gu via replacement edges. A large solid 
circle indicates an edge that fans out toward v. 



Proof. First, we reduce the problem to the case of a biconnected graph, thus 
eliminating the nodes of degree 0 and 1. Second, we decide for each edge the 
direction in which it can be fanned out. In the third stage, we replace each node 
V with its respective gadget Hy. Fourth, and finally, we connect the gadgets, 
making sure that if Hu assumes that the edge {u,v} fans out toward u, we 
allowed for that in the second stage. Fig. 2 illustrates the last three stages of 
this process. 

It is easy to see that a node of degree d is replaced with a gadget with d or 
d — 1 nodes, thus the total number of nodes in the new graph is bounded by the 
sum of node degrees in the original graph, i.e., 2m. The estimate of the resulting 
number of edges is less straightforward. We consider two classes of nodes: 
“original”, and “extras” - extra replacements and connections. Obviously, we 
have exactly m original edges. For the extras, one can check that for a node v 
of degree d < 5, Hy contains at most 2.5d — 5 extra edges, with the exception 
of T-nodes of degree 3, that have 3 = 2.5 x 3 - 5 -h 0.5 extra edges. Moreover, 
melding with Q increases d by 2, and the number of extra edges by 5. By adding 
these expressions for all nodes we obtain the claimed inequality. □ 



Obviously, the smaller the graph that is produced by our transformation, the 
less time we will need to run an algorithm for Minimum Cost Perfect matching. 
Can one show a set of provably minimal gadgets? We can answer this question 
partially, i.e., the number of nodes cannot be decreased in any of our gadgets. 
We also conjecture that our gadgets use the minimum number of extra edges as 
well. For degrees smaller than 6 we have verified this conjecture by an exhaustive 
case analysis. 
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Finally we can apply the best known so far algorithm by Gabow and Tarjan 
[6] to solve the perfect matching problem. 

Theorem 4 There exists an algorithm, that solves the Minim, um, Perturbation 
Problem, in tim,e 0({n\ogn)^^^)a{n), where a is the inverse of Ackerm,an func- 
tion. 



4 Computational Experience 

For the VLSI mask layout application, we have implemented several approaches, 
including the reduction to perfect matching using the gadgets we have described, 
in C+H- on a Unix platform. For solving the Perfect Matching problem, we have 
used the most recent and fastest implementation, due to Cook and Rohe [5]. Ta- 
ble 1 summarizes our computational experience with three layouts of different 
sizes and densities. All layouts were derived from industry standard-cell layouts. 
All runtimes are CPU seconds on a 300 MHz Sun Ultra-10 workstation with 
128MB RAM. We see that our code can handle very large flat designs in rea- 
sonable time, and is a promising basis for phase assignment in alternating PSM 
design, as well as other sparse instances of the T-join problem. Table 1 also 
confirms that our new exact method significantly improves over the previous 
methods of [15] [10]: it reduces by 40% the number of unresolved phase con- 
flicts, which correspondingly reduces the amount of layout modification needed 
in compaction. Finally, we also implemented the approximation algorithm for 
the T-join problem from [7]. Our results show that the average deviation from 
the optimum for the Goemans- Williamson algorithm is around 10%, which is 
significantly larger than the 2% for Euclidean matchings reported in [18]. 
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runtime 
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runtime 
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Abstract. We present simple, practical and efficient data structures 
for the fundamental problem of maintaining a resizable one-dimensional 
array, A[/../ -h n — 1], of fixed-size elements, as elements are added to 
or removed from one or both ends. Our structures also support access 
to the element in position i. All operations are performed in constant 
time. The extra space (i.e., the space used past storing the n current 
elements) is 0{y/n) at any point in time. This is shown to be within a 
constant factor of optimal, even if there are no constraints on the time. 
If desired, each memory block can be made to have size 2^ — c for a 
specified constant c, and hence the scheme works effectively with the 
buddy system. The data structures can be used to solve a variety of 
problems with optimal bounds on time and extra storage. These include 
stacks, queues, randomized queues, priority queues, and deques. 



1 Introduction 

The initial motivation for this research was a fundamental problem arising in 
many randomized algorithms [6,8,10]. Specifically, a randomized queue main- 
tains a collection of fixed-size elements, such as word-size integers or pointers, 
and supports the following operations: 

1. Insert (e): Add a new element e to the collection. 

2. DeleteRandom: Delete and return an element chosen uniformly at random 
from the collection. 

That is, if n is the current size of the set, DeleteRandom must choose each 
element with probability 1/n. We assume our random number generator returns 
a random integer between 1 and n in constant time. 

At first glance, this problem may seem rather trivial. However, it becomes 
more interesting after we impose several important restrictions. The first con- 
straint is that the data structure must be theoretically efficient: the operations 
should run in constant time, and the extra storage should be minimal. The sec- 
ond constraint is that the data structure must be practical: it should be simple 
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to implement, and perform well under a reasonable model of computation, e.g., 
when the memory is managed by the buddy system. The final constraint is more 
amusing and was posed by one of the authors: the data structure should be 
presentable at the first or second year undergraduate level in his text [10]. 

One natural implementation of randomized queues stores the elements in an 
array and uses the doubling technique [3]. Insert (e) simply adds e to the end of 
the array, increasing n. If the array is already full. Insert first resizes it to twice 
the size. DeleteRandom chooses a random integer between 1 and n, and retrieves 
the array element with that index. It then moves the last element of the array to 
replace that element, and decreases n, so that the first n elements in the array 
always contain the current collection. 

This data structure correctly implements the Insert and DeleteRandom op- 
erations. In particular, moving the last element to another index preserves the 
randomness of the elements chosen by DeleteRandom. Furthermore, both opera- 
tions run in 0(1) amortized time: the only part that takes more than constant 
time is the resizing of the array, which consists of allocating a new array of double 
the size, copying the elements over, and deallocating the old array. Because n/2 
new elements were added before this resizing occurred, we can charge the 0(n) 
cost to them, and achieve a constant amortized time bound. The idea is easily 
extended to permit shrinkage: simply halve the size of the structure whenever it 
drops to one third full. The amortization argument still goes through. 

The 0(n) space occupied by this structure is optimal up to a constant factor, 
but still too much. Granted, we require at least n units of space to store the 
collection of elements, but we do not require 4.5n units, which this data structure 
occupies while shrinkage is taking place. We want the extra space^ the space in 
excess of n units, to be within a constant factor of optimal, so we are looking 
for an n -h o(n) solution. 

1.1 Resizable Arrays 

This paper considers a generalization of the randomized queue problem to (one- 
dimensional) resizable arrays. A singly resizable array maintains a collection of 
n fixed-size elements, each assigned a unique index between 0 and n — 1, subject 
to the following operations: 

1. Read (i): Return the element with index i, 0 < i < n. 

2. Write (i, x): Set the element with index i to x, 0 < i < n. 

3. Grow: Increment n, creating a new element with index n. 

4. Shrink: Decrement n, discarding the element with index n — 1. 

As we will show, singly resizable arrays solve a variety of fundamental data- 
structure problems, including randomized queues as described above, stacks, 
priority queues, and indeed queues. In addition, many modern programming 
languages provide built-in abstract data types for resizable arrays. For example, 
the C-l-H- vector class [11, sec. 16.3] is such an ADT. 

Typical implementations of resizable arrays in modern programming systems 
use the “doubling” idea described above, growing resizable arrays by any con- 
stant factor c. This implementation has the major drawback that the amount 
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of wasted space is linear in n, which is unnecessary. Optimal space usage is es- 
sential in modern programming applications with many resizable arrays each of 
different size. For example, in a language such as C++, one might use compound 
data structures such as stacks of queues or priority queues of stacks that could 
involve all types of resizable structures of varying sizes. 

In this paper, we present an optimal data structure for singly resizable arrays. 
The worst-case running time of each operation is a small constant. The extra 

storage at any point in time is 0(y/n), which is shown to be optimal up to 

a constant factor.^ Furthermore, the algorithms are simple, and suitable for 
use in practical systems. While our exposition here is designed to prove the 
most general results possible, we believe that one could present one of the data 
structures (e.g., our original goal of the randomized queue) at the first or second 
year undergraduate level. 

A natural extension is the efficient implementation of a deque (or double- 
ended queue). This leads to the notion of a doubly resizable array which main- 
tains a collection of n fixed-size elements. Each element is assigned a unique 
index between i and u (where u - i 1 — n and are potentially negative), 
subject to the following operations: 

1. Read (i): Return the element with index i < i < u. 

2. Write (i, x): Set the element with index i to x, i < i < u. 

3. GrowForward: Increment u, creating a new element with index w + 1. 

4. ShrinkForward: Decrement u, discarding the element with index u. 

5. GrowBackward: Decrement I, creating a new element with index £ - 1. 

6. ShrinkBackward: Increment £, discarding the element with index 1. 

An extension to our method for singly resizable arrays supports this data type 
in the same optimal time and space bounds. 

The rest of this paper is outlined as follows. Section 2 describes our fairly 
realistic model for dynamic memory allocation. In Section 3, we present a lower 
bound on the required extra storage for resizable arrays. Section 4 presents our 
data structure for singly resizable arrays. Section 5 describes several applications 
of this result, namely optimal data structures for stacks, queues, randomized 
queues, and priority queues. Finally, Section 6 considers deques, which require 
us to look at a completely new data structure for doubly resizable arrays. 

2 Model 

Our model of computation is a fairly realistic mix of several popular models: a 
transdichotomous [4] random access machine in which memory is dynamically 
allocated. Our model is random access in the sense that any element in a block 
of memory can be accessed in constant time, given just the block pointer and 
an integer index into the block. Fredman and Willard [4] introduced the term 
transdichotomous to capture the notion of the problem size matching the ma- 
chine word size. That is, a word is large enough to store the problem size, and 

^ For simplicity of exposition, we ignore the case n = 0 in our bounds; the correct 
statement for a bound of 0(6) is the more tedious 0(1 +6). 
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SO has at least [log 2 (l + n)] bits (but not many more). In practice, it is usually 
the case that the word size is fixed but larger than log 2 M where M is the size of 
the memory (which is certainly at least n + 1). Our model of dynamic memory 
allocation matches that available in most current systems and languages, for 
example the standard C library. Three operations are provided: 

1. Allocate (s): Returns a new block of size s. 

2. Deallocate (B): Frees the space used by the given block B. 

3. Reallocate (B, s): If possible, resizes the block B to the specified size s. 
Otherwise, allocates a block of size s, into which it copies the contents of B, 
and deallocates B. In either case, the operation returns the resulting block 
of size s. 

Hence, in the worst case. Reallocate degenerates to an Allocate, a block copy, 
and a Deallocate. It may be more efficient in certain practical cases, but it offers 
no theoretical benefits. 

A memory block B consists of the user’s data, whose size we denote by |B|, 
plus a header of fixed size h. In many cases, it is desirable to have the total size 
of a block equal to a power of two, that is, have |B| = 2^ — h for some k. This 
is particularly important in the binary buddy system [6, vol. 1, p. 435], which 
otherwise rounds to the next power of two. If all the blocks contained user data 
whose size is a power of two, half of the space would be wasted. 

The amount of space occupied by a data structure is the sum of total block 
sizes, that is, it includes the space occupied by headers. Hence, to achieve o(n) 
extra storage, there must be o(n) allocated blocks. 

3 Lower Bound 

Theorem 1. f2{y/n) extra storage is necessary in the worst case for any data 
structure that supports inserting elements, and deleting those elements in some 
(arbitrary) order. In particular, this lower bound applies to resizable arrays, 
stacks, queues, randomized queues, priority queues, and deques. 

Proof. Consider the following sequence of operations: 

Insert (ui), . . . , Insert (un), Delete, . . . , Delete. 

V ^ . 

n times 

Apply the data structure to this sequence, separately for each value of n. Con- 
sider the state of the data structure between the inserts and the deletes: let /(n) 
be the size of the largest memory block, and let g{n) be the number of memory 
blocks. Because all the elements are about to be reported to the user (in an ar- 
bitrary order), the elements must be stored in memory. Hence, /(n) • g{n) must 
be at least n. 

At the time between the inserts and the deletes, the amount of extra storage 
is at least hg{n) to store the memory block headers, and hence the worst-case 
extra storage is at least g{n). Furthermore, at the time immediately after the 
block of size /(n) was allocated, the extra storage was at least f{n). Hence, the 
worst-case extra storage is at least max{f{n),g{n)}. Because f {n) • g{n) > n, 
the minimum worst-case extra storage is at least y/n. □ 
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This theorem also applies to the related problem of vectors in which elements 
can be inserted and deleted anywhere. Goodrich and Kloss [5] show that 0{y/n) 
amortized time suffices for updates, even when access queries must be performed 
in constant time. They use 0{y/n) extra space, which as we see is optimal. 

4 Singly Resizable Arrays 

The basic idea of our first data structure is storing the elements of the array in 
0{y/n) blocks, each of size roughly y/n. Now because n is changing over time, 
and we allocate the blocks one-by-one, the blocks have sizes ranging from 0(1) to 
0{y/n). One obvious choice is to give the ith block size z, thus having k{k -f l)/2 
elements in the first k blocks. The number of blocks required to store n elements, 
then, is |"(\/1 + 8n - l)/2] = 0(>/n). 

The problem with this choice of block sizes is the cost of finding a desired 
element in the collection. More precisely, the Read and Write operations must 
first determine which element in which block has the specified index, in what we 
call the Locate operation. With the block sizes above, computing which block 
contains the desired element i requires computing the square root of 1 + 8i. 
Newton’s method [9, pp. 274-292] is known to minimize the time for this, taking 
0(loglogz) time in the worst case. This prevents Read and Write from running 
in the desired 0(1) time bound. ^ 

Another approach, related to that of doubling, is to use a sequence of blocks 
of sizes the powers of 2, starting with 1. The obvious disadvantage of these sizes 
is that half the storage space is wasted when the last block is allocated and 
contains only one element. We notice however that the number of elements in 
the first k blocks is 2^ - 1, so the block containing element i is [lc>g2(l + ^)J- 
This is simply the position of the leading 1-bit in the binary representation of 
z + 1 and can be computed in 0(1) time (see Section 4.1). 

Our solution is to sidestep the disadvantages of each of the above two ap- 
proaches by combining them so that Read and Write can be performed in 0(1) 
time, but the amount of extra storage is at most 0{y/n). The basic idea is to 
have conceptual superblocks of size 2\ each split into approximately 2*/^ blocks 
of size approximately 2*/^ . Determining which superblock contains element z can 
be done in 0(1) time as described above. Actual allocation of space is by block, 
instead of by superblock, so only 0{y/n) storage is wasted at any time. 

This approach is described more thoroughly in the following sections. We 
begin in Section 4.1 with a description of the basic version of the data structure. 
Section 4.2 shows how to modify the algorithms to make most memory blocks 
have total size a power of two, including the size of the block headers. 

4.1 Basic Version 

The basic version of the data structure consists of two types of memory blocks: 
one index blocks and several data blocks. The index block simply contains pointers 
to all of the data blocks. The data blocks, denoted DBq, . . . , DBd-i, store all of 

^ In fact, one can use 0{y/n) storage for a lookup table to support const ant-time 
square-root computation, using ideas similar to those in Section 4.1. Here we develop 
a much cleaner algorithm. 
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Grow: 

1. If the last nonempty data block DBd-i is full: 

(a) If the last superblock SB3-1 is full: 

i. Increment s. 

ii. If s is odd, double the number of data blocks in a superblock. 

iii. Otherwise, double the number of elements in a data block. 

iv. Set the occupancy of SBs-i to empty. 

(b) If there are no empty data blocks: 

i. If the index block is full. Reallocate it to twice its current size. 

ii. Allocate a new last data block; store a pointer to it in the index block. 

(c) Increment d and the number of data blocks occupying SBs-i- 

(d) Set the occupancy of DBd-i to empty. 

2. Increment n and the number of elements occupying DBd-i^ 

Algorithm 1. Basic implementation of Grow. 

the elements in the resizable array. Data blocks are clustered into superblocks as 
follows: two data blocks are in the same super block precisely if they have the 
same size. Although superblocks have no physical manifestation, we will find it 
useful to talk about them with some notation, namely , SBg-i. When 

superblock SBk is fully allocated, it consists of data blocks, each of size 

2 r^/ 2 l Hence, there are a total of 2^ elements in superblock SBk. See Fig. 1. 



Index block 



Data blocks 




SBq SB I SB2 SBs—i 

Fig. 1. A generic snapshot of the basic data structure. 



We reduce the four resizable-array operations to three “fundamental” oper- 
ations as follows. Grow and Shrink are defined to be already fundamental; they 
are sufficiently different that we do not merge them into a single “resize” opera- 
tion. The other two operations. Read and Write, are implemented by a common 
operation Locate (i) which determines the location of the element with index i. 

The implementations of the three fundamental array operations are given in 
Algorithms 1-3. Basically, whenever the last data block becomes full, another 
one is allocated, unless an empty data block is already around. Allocating a 
data block may involve doubling the size of the index block. Whenever two data 
blocks become empty, the younger one is deallocated; and whenever the index 
block becomes less than a quarter full, it is halved in size. To find the block 
containing a specified element, we find the superblock containing it by computing 
the leading 1-bit, then the appropriate data block within the superblock, and 
finally the element within that data block. 

Note that the data structure also has a constant-size block, which stores the 
number of elements (n), the number of superblocks (s), the number of nonempty 
data blocks (d), the number of empty data blocks (which is always 0 or 1), and 
the size and occupancy of the last nonempty data block, the last superblock, 
and the index block. 
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Shrink: 

1. Decrement n and the number of elements occupying the last nonempty 
data block DBd-i- 

2. If DBd-i is empty: 

(a) If there is another empty data block, Deallocate it. 

(b) If the index block is a quarter full, Reallocate it to half its size. 

(c) Decrement d and the number of data blocks occupying the last 
superblock SBs-i> 

(d) If SBs-i is empty: 

i. Decrement s. 

ii. If 5 is even, halve the number of data blocks in a superblock. 

iii. Otherwise, halve the number of elements in a data block. 

iv. Set the occupancy of SBs-i to full. 

(e) Set the occupancy of DBd-i to full. 

Algorithm 2. Basic implementation of Shrink. 



Locate (i): 

1. Let r denote the binary representation of i + 1, with all leading zeros removed. 

2. Note that the desired element i is element e of data block h of superblock 
where 

(a) k = \r\ — 1, 

(b) b is the [k/2\ bits of r immediately after the leading 1-bit, and 

(c) e is the last \k/2~\ bits of r. 

3. Let p = 2^ — 1 be the number of data blocks in superblocks prior to SBk^ 

4. Return the location of element e in data block DBp^t^ 

Algorithm 3. Basic implementation of Locate. 

In the rest of this section, we show the following theorem: 

Theorem 2. This data structure implements singly resizable arrays using 
0{y/n) extra storage in the worst case and 0(1) time per operation, on a ran- 
dom access machine where memory is dynamically allocated, and binary shift by 
k takes 0(1) time on a word of size |’log 2 (l + n)] . Furthermore, if Allocate or 
Deallocate is called when n = no, then the next call to Allocate or Deallocate will 
occur after ^2{^/n^) operations. 

The space bound follows from the following lemmas. See [2] for proofs. 
Lemma 1. The number of superblocks (s) is [log 2 (l + n )] . 

Lemma 2. At any point in time, the number of data blocks is 0{y/n). 

Lemma 3. The last (empty or nonempty) data block has size 0{y/n). 

To prove the time bound, we first show a bound of 0(1) for Locate, and then 
show how to implement Reallocate first in 0(1) amortized time and then in 0(1) 
worst-case time. 

The key issue in performing Locate is the determination of fc = [^og 2 (l + ^)1 , 
the position of the leading 1-bit in the binary representation of z + 1. Many 
modern machines include this instruction. Newer Pentium chips do it as quickly 
as an integer addition. Brodnik [1] gives a constant-time method using only 
basic arithmetic and bitwise boolean operators. Another very simple method is 
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to store all solutions of “half-length,” that is for values of i up to 2 L(^°S 2 (i+^))/ 2 J — 
0{y/n). Two probes into this lookup table now suffice. We check for the leading 
1-bit in the first half of the 1 -h Llog 2 (l + ^)J bit representation of i, and if there 
is no 1-bit, check the trailing bits. The lookup table is easily maintained as n 
changes. From this we see that Algorithm 3 runs in constant time. 

We now have an 0(1) time bound if we can ignore the cost of dynamic 
memory allocation. First let us show that Allocate and Deallocate are only called 
once every operations as claimed in Theorem 2. Note that immediately 

after allocating or deallocating a data block, the number of unused elements in 
data blocks is the size of the last data block. Because we only deallocate a data 
block after two are empty, we must have called Shrink at least as many times as 
the size of the remaining empty block, which is f2{y/n) by Lemma 3. Because we 
only allocate a data block after the last one becomes full, we must have called 
Grow at least as many times as the size of the full block, which again is i?(\/n). 

Thus, the only remaining cost to consider is that of resizing the index block 
and the lookup table (if we use one), as well as maintaining the contents of 
the lookup table. These resizes only occur after fi{y/n) data blocks have been 
allocated or deallocated, each of which (as we have shown) only occurs after 
f2{y/n) updates to the data structure. Hence, the cost of resizing the index block 
and maintaining the lookup table, which is 0(n), can be amortized over these 
updates, so we have an 0(1) amortized time bound. 

One can achieve a worst-case running time of 0(1) per operation as follows. 
In addition to the normal index block, maintain two other blocks, one of twice 
the size and the other of half the size, as well as two counters indicating how 
many elements from the index block have been copied over to each of these 
blocks. In allocating a new data block and storing a pointer to it in the index 
block, also copy the next two uncopied pointers (if there are any) from the index 
block into the double-size block. In deallocating a data block and removing the 
pointer to it, also copy the next two uncopied pointers (if there are any) from 
the index block into the half-sized block. 

Now when the index block becomes full, all of the pointers from the index 
block have been copied over to the double-size block. Hence, we Deallocate the 
half-size block, replace the half-size block with the index block, replace the index 
block with the double-size block, and Allocate a new double-size block. When 
the index block becomes a quarter full, all of the pointers from the index block 
have been copied over to the half-size block. Hence, we Deallocate the double-size 
block, replace the double-size block with the index block, replace the index block 
with the half-size block, and Allocate a new half-size block. 

The maintenance of the lookup table can be done in a similar way. The only 
difference is that whenever we allocate a new data block and store a pointer to 
it in the index block, in addition to copying the next two uncopied elements (if 
there are any), compute the next two uncomputed elements in the table. Note 
that the computation is done trivially, by monitoring when the answer changes, 
that is, when the question doubles. Note also that this method only adds a 
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constant factor to the extra storage, so it is still 0{y/n). The time per operation 
is therefore 0(1) in the worst case. 

4.2 The Buddy System 

In the basic data structure described so far, the data blocks have user data of 
size a power of two. Because some memory management systems add a block 
header of fixed size, say h, the total size of each block can be slightly more than a 
power of two {2^-\-h for some k) . This is inappropriate for a memory management 
system that prefers blocks of total size a power of two. For example, the (binary) 
buddy system [6, vol. 1, p. 540] rounds the total block size to the next power 
of two, so the basic data structure would use twice as much storage as required, 
instead of the desired 0{y/n) extra storage. While the buddy system is rarely 
used exclusively, most UNIX operating systems (e.g., BSD [7, pp. 128-132]) use 
it for small block sizes, and allocate in multiples of the page size (which is also 
a power of two) for larger block sizes. Therefore, creating blocks of total size 
a power of two produces substantial savings on current computer architectures, 
especially for small values of n. 

This section describes how to solve this problem by making the size of the user 
data in every data block equal to2^ -h for some k. As far as we know, this is the 
first theoretical algorithm designed to work effectively with the buddy system. 
To preserve the ease of finding the superblock containing element number i, we 
still want to make the total number of elements in superblock SBk equal to 2^. 
To do this, we introduce a new type of block called an overflow block. There will 
be precisely one overflow block OBk per superblock SBk. This overflow block is 
of size , and hence any waste from using the buddy system is 0(\/fc). 

Conceptually, the overflow block stores the last h elements of each data block 
in the superblock. We refer to a data block DBi together with the corresponding h 
elements in the overflow block as a conceptual block CB{. Hence, each conceptual 
block in superblock SBk has size , as did the data blocks in the basic data 
structure. 

We now must maintain two index blocks: the data index block stores pointers 
to all the data blocks as before, and the overflow index block stores pointers to 
all the overflow blocks. As before, we double the size of an index block whenever 
it becomes full, and halve its size whenever it becomes a quarter full. 

The algorithms for the three fundamental operations are given in Algo- 
rithms 4-6. They are similar to the previous algorithms; the only changes are as 
follows. Whenever we want to insert or access an element in a conceptual block, 
we first check whether the index is in the last h possible values. If so, we use the 
corresponding region of the overflow block, and otherwise we use the data block 
as before. The only other difference is that whenever we change the number of 
superblocks, we may allocate or deallocate an overflow block, and potentially 
resize the overflow index block. 

We obtain an amortized or worst-case 0(1) time bound as before. It remains 
to show that the extra storage is still 0(>/n). The number s of overflow blocks 
is O(logn) by Lemma 1, so the block headers from the overflow blocks are 
sufficiently small. Only the last overflow block may not be full of elements; 
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Grow: 

1. If the last nonempty conceptual block CBd-i is full: 

(a) If the last superblock SBa-\ is full: 

i. Increment s. 

ii. If s is odd, double the number of data blocks in a superblock. 

iii. Otherwise, double the number of elements in a conceptual block. 

iv. Set the occupancy of SBs-\ to empty. 

V. If there are no empty overflow blocks: 

— If the overflow index block is full. Reallocate it to twice 
its current size. 

- Allocate a new last overflow block, and store a pointer to it in 
the overflow index block. 

(b) If there are no empty data blocks: 

i. If the data index block is full. Reallocate it to twice its current size. 

ii. Allocate a new last data block, and store a pointer to it in 
the data index block. 

(c) Increment d and the number of data blocks occupying SBs-i> 

(d) Set the occupancy of CBd-i to empty. 

2. Increment n and the number of elements occupying CBd-i- 

Algorithm 4. Buddy implementation of Grow. 

its size is at most h times the size of the last data block, which is 0{y/n) by 
Lemma 3. The overflow index block is at most the size of the data index block, so 
it is within the bound. Finally, note that the blocks whose sizes are not powers 
of two (the overflow blocks and the index blocks) have a total size of 0{y/n), 
so doubling their size does not affect the extra storage bound. Hence, we have 
proved the following theorem. 

Theorem 3. This data structure implements singly resizable arrays in 0{y/n) 
worst-case extra storage and 0(1) time per operation, on a |’log 2 (l + n)] bit 
word random access machine where memory is dynamically allocated in blocks 
of total size a power of two, and binary shift by k takes 0(1) time. Furthermore, 
if Allocate or Deallocate is called when n = uq, then the next call to Allocate or 
Deallocate will occur after f2{y/no) operations. 

5 Applications of Singly Resizable Arrays 

This section presents a variety of fundamental abstract data types that are 
solved optimally (with respect to time and worst-case extra storage) by the data 
structure for singly resizable arrays described in the previous section. Please 
refer to [2] for details of the algorithms. 

Corollary 1. Stacks can be implemented in 0(1) worst-case time per operation, 
and 0{y/n) worst-case extra storage. 

Furthermore, the general Locate operation can be avoided by using the point- 
er to the last element in the array. Thus the computation of the leading 1-bit 
is not needed. This result can also be shown or the following data structure by 
keeping an additional pointer to an element in the middle of the array [2]. 
Corollary 2. Queues can be implemented in 0(1) worst-case time per opera- 
tion, and 0{y/n) worst- case extra storage. 
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Shrink: 

1. Decrement n and the number of elements occupying CBd-i> 

2. If CBd-i is empty: 

(a) If there is another empty data block, Deallocate it. 

(b) If the data index block is a quarter full, Reallocate it to half its size. 

(c) Decrement d and the number of data blocks occupying the last 
superblock SBs-i- 

(d) If SBs-i is empty: 

i. If there is another empty overflow block. Deallocate it. 

ii. If the overflow index block is a quarter full. Reallocate it to half its size, 
hi. Decrement s. 

iv. If s is even, halve the number of data blocks in a superblock. 

V. Otherwise, halve the number of elements in a conceptual block, 
vi. Set the occupancy of SBg-i to full. 

(e) Set the occupancy of DBd-i to full. 

Algorithm 5. Buddy implementation of Shrink. 



Locate (i): 

1. Let r denote the binary representation of i + 1, with all leading zeros removed. 

2. Note that the desired element i is element e of conceptual block b of 
superblock fc, where 

(a) k = \r\ — 1, 

(b) b is the [k/2\ bits of r immediately after the leading 1-bit, and 

(c) e is the last [/c/2] bits of r. 

3. Let j = be the number of elements in conceptual block 6. 

4. If e > j — h, element i is stored in an overflow block: 

Return the location of element bh e — {j — h) in overflow block OBk> 

5. Otherwise, element i is stored in a data block: 

(a) Let p = 2^ — 1 be the number of data blocks in superblocks prior to SBk^ 

(b) Return the location of element e in data block DBp^b- 

Algorithm 6. Buddy implementation of Locate. 

Corollary 3. Randomized queues can be implemented in 0{y/n) worst-case ex- 
tra storage, where Insert takes 0(1) worst-case time, and DeleteRandom takes 
time dominated by the cost of computing a random number between 1 and n. 

Corollary 4. Priority queues can be implemented in O(logn) worst-case time 
per operation, and 0{y/n) worst-case extra storage. 

Corollary 5. Double-ended priority queues (which support both DeleteMin and 
DeleteMax) can be implemented in O(logn) worst-case time per operation, and 
0{y/n) worst- case extra storage. 

6 Doubly Resizable Arrays and Deques 

A natural extension to our results on optimal stacks and queues would be to 
support deques (double-ended queues). It is easy to achieve an amortized time 
bound by storing the queue in two stacks, and flipping half of one stack when 
the other becomes empty. To obtain a worst-case time bound, we use a new data 
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structure that keeps the blocks all roughly the same size (within a factor of 2). 
By dynamically resizing blocks, we show the following result; see [2] for details. 
Theorem 4. A doubly resizable array can be implemented using 0{y/n) extra 
storage in the worst case and 0(1) time per operation, on a trans dichotomous 
random access machine where memory is dynamically allocated. 

Note that this data structure avoids finding the leading 1-bit in the binary 
representation of an integer. Thus, in some cases (e.g., when the machine does 
not have an instruction finding the leading 1-bit), this data structure may be 
preferable even for singly resizable arrays. 

7 Conclusion 

We have presented data structures for the fundamental problems of singly and 
doubly resizable arrays that are optimal in time and worst-case extra space on re- 
alistic machine models. We believe that these are the first theoretical algorithms 
designed to work in conjunction with the buddy system, which is practical for 
many modern operating systems including UNIX. They have led to optimal data 
structures for stacks, queues, priority queues, randomized queues, and deques. 

Resizing has traditionally been explored in the context of hash tables [3]. 
Knuth traces the idea back at least to Hopgood in 1968 [6, vol. 3, p. 540]. An 
interesting open question is whether it is possible to implement dynamic hash 
tables with o(n) extra space. 

We stress that our work has focused on making simple, practical algorithms. 
One of our goals is for these ideas to be incorporated into the C-h-h standard 
template library (STL). We leave the task of expressing the randomized queue 
procedure in a form suitable for first year undergraduates as an exercise for the 
fifth author. 
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Abstract. A new way of constructing (minimal) perfect hash functions 
is described. The technique considerably reduces the overhead associ- 
ated with resolving buckets in two-level hashing schemes. Two memory 
probes suffice for evaluation of the function. This improves the probe 
performance of previous minimal perfect hashing schemes, and is shown 
to be optimal. 



1 Introduction 

This paper deals with classes of hash functions which are perfect for the n-subsets 
of the finite universe 17 = {0, . . . , u — 1}. For any S G (^) - the subsets of U of 
size n - a perfect class contains a function which is 1-1 on S (“perfect” for 5). 
We consider perfect classes with range {0, . . . , a - 1}. 

A perfect class of hash functions can be used to construct static dictionaries. 
The attractiveness of using perfect hash functions for this purpose depends on 
several characteristics of the class. 1. Efficiency of evaluation, in terms of com- 
putation and the number of probes into the description. 2. Time needed to find 
a perfect function in the class. 3. Size of the range of functions compared to the 
minimum, n. 4. Space required to store a function. 

Fredman, Komlos and Szemeredi [7] showed that it is possible to construct 
space efficient perfect hash functions with range a = 0{n) which can be eval- 
uated in constant time. Their model of computation (which we adopt here) is 
a word RAM with unit cost arithmetic operations and memory lookups, where 
an element of U fits into one machine word. More precisely, evaluation of their 
function requires evaluation of two universal hash functions and two probes into 
the function description. Using one additional probe, the function can be made 
minimal, i.e. with range n. The hash function can be constructed in expected 
time 0{n) by a randomized algorithm, and the function description occupies 
0(n) machine words. More information on the development of perfect hashing 
can be found in the survey of Czech, Havas and Majewski [3]. 
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project number 20244 (ALCOM-IT) 

Basic Research in Computer Science, 

Centre of the Danish National Research Foundation. 




50 



This paper presents a simple perfect class of hash functions which matches 
the best results regarding 2 and 3 above, and at the same time improves upon 
the best known efficiency of evaluation. The space usage is competitive with 
similar schemes. Altogether, the class is believed to be attractive in practice. 

The class of hash functions presented here can be seen as a variation of an 
early perfect class of hash functions due to Tarjan and Yao [10]. Their “single 
displacement” class requires a universe of size u = O(n^). The idea is to split 
the universe into blocks of size 0(n), each of which is assigned a “displacement” 
value. The zth element within the jth block is mapped to i + dj, where dj is the 
displacement value of block j. A range of size 0{n) is possible if a certain “har- 
monic decay” condition on the set S holds. To achieve harmonic decay, Tarjan 
and Yao perform a suitable permutation of the universe. The central observa- 
tion of this paper is that a reduction of the universe to size O(n^), as well as 
harmonic decay, can be achieved using universal hash functions. Or equivalently, 
that buckets in a (universal) hash table can be resolved using displacements. 

2 A Perfect Class of Hash Functions 

The concept of universality [2] plays an important role in the analysis of our 
class. We use the following notation. 

Definition 1. A class of functions Hr = {hi, . . . , hk}, hi :U {0, . . . , r — 1}, 
is c-universal if for any x,y £ U , x ^ y, PTi[hi{x) = hi{y)] <c/r. 

Many such classes with constant c are known, see e.g. [4]. For our application 
the important thing to note is that there are universal classes that allow effi- 
cient storage and evaluation of their functions. More specifically, O(logu) bits of 
storage suffice, and a constant number of simple arithmetic and bit operations 
are enough to evaluate the functions. Furthermore, c is typically in the range 
1 < c < 2. 

The second ingredient in the class definition is a displacement function. Gen- 
eralizing Tarjan and Yao’s use of integer addition, we allow the use of any group 
structure on the blocks. It is assumed that the elements {0,...,a— 1} of a block 
correspond to (distinct) elements in a group (G, ffl,e), such that group opera- 
tions may be performed on them. We assume the group operation and element 
inversion to be computable in constant time. A displacement function is a func- 
tion of the form x i-> x ffl d for d G G. The element d is called the displacement 
value of the function. We are ready to define the class. 

Definition 2. Let (G,ffl,e) be a group, D C G a set of displacement values, 
and let Ha and Hi, be Cf- and Cg -universal, respectively. We define the following 
class of functions from U to G: 

H{^,D,a,b) == [h \ h{x) = f{x) ffld^(^),/ £ Ha, g £ Ht,di £ D} . 

Evaluation of a function in 'H{^,D,a,b) consists of using a displacement func- 
tion, determined by g{x), on the value /(x). In terms of the Tarjan- Yao scheme. 
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g assigns a block number to each element of [/, and / determines its number 
within the block. The next section will show that when a > CfujA and b > 2cgn, 
it is possible to find /, g and displacement values such that the resulting function 
is perfect. The class requires storage of b displacement values. Focusing on in- 
stances with a reasonable memory usage, we from now on assume that b = 0{n) 
and that elements of D can be stored in one machine word. The range of func- 
tions in D,a,b) is {x ffl d | x G {0, . . . , a - 1}, d G D}. The size of this set 
depends on the group in question and the set D. Since our interest is in functions 
with range n (or at most 0{n)), we assume a < n. For properties of randomly 
chosen functions of a class similar to 6), see [5, Sect. 4]. 

2.1 Analysis 

This section gives a constructive (but randomized) method for finding perfect 
hash functions in the class 7^(ffl, D, a, b) for suitable D, a and b. The key to the 
existence of proper displacement values is a certain “goodness” condition on / 
and g. Let B{g,i) = {a: G 5 | g{x) = i] denote the elements in the ith block 
given by g. 

Definition 3. Let r >1. A pair (f,g) ^ Ha x Hi, is r-good (for S) if 

1. The function x {f(x),g{x)) is 1-1 on S, and 
Ei,|B(9,i)|>i < n/r . 

The first condition says that / and g successfully reduce the universe to size ab 
(clearly, if {f{xi),g{xi)) = {f{x 2 ),g{x 2 )) then regardless of displacement values, 
Xi and X 2 collide). The second condition implies the harmonic decay condition 
of Tarjan and Yao (however, it is not necessary to know their condition to un- 
derstand what follows). We show a technical lemma, estimating the probability 
of randomly finding an r-good pair of hash functions. 

Lemma 4. Assume a > Cfu/Ar, b > 2cgrn. For any S G (^), randomly chosen 
{f^g) E Ha X Ht is r-good for S with probability > (1 - ^^|^)(1 — ^) > 0. 

Proof. By -universality, the expected value for the sum (l^^|’di) — 

2 

X{„,t,}e(f),p(„)=j(^) 1 is bounded by (”)^ < so applying Markov’s in- 
equality, the sum has value less than n/4r with probability > 1 — Since 

\Big,i)\^ < for|5(i^,i)| > 1, we then have that ^;i,|B(p,i)|>i < 

n/r. Given a function g such that these inequalities hold, we would like to bound 
the probability that x {f{x),g{x)) is 1-1 on S. By reasoning similar to before, 
we get that for randomly chosen / the expected number of collisions among 
elements of B{g,i) at most Summing over i we get the expected 

total number of collisions to be less than CfujAra. Hence there is no collision 
with probability more than 1 - By the law of conditional probabilities, the 
probability of fulfilling both r-goodness conditions can be found by multiplying 
the probabilities found. □ 
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We now show that r-goodness is sufficient for displacement values to exist, 
by means of a constructive argument in the style of [10, Theorem 1]: 

Theorem 5. Let (/, g) e Ha x Ht be r-good for S £ r > 1, and let \D\ > n. 
Then there exist do, • • • , db-i £ D, such that x f{x) ffl dg(^x) 'Is 1-1 on S. 

Proof. Note that displacement di is used on the elements {f{x) \ x £ B{g,i)}^ 
and that any h £ D, a, b) is 1-1 on each B{g, i). We will assign the displace- 
ment values one by one, in non-increasing order of \B{g^i)\. At the kth step, we 
will have displaced the A: — 1 largest sets, {B{g,i) \ i £ I}, |/| = fc — 1, and want to 
displace the set B{g,j) such that no collision with previously displaced elements 
occurs. If \B{g,j)\ < 1 this is trivial since \D\ > n, so we assume \B{g,j)\ > 1. 
The following claim finishes the proof: 

Claim. If \B{g^j)\ > 1, then with positive probability, namely more than 1 — 
a randomly chosen d £ D displaces B{g,j) with no collision. 

Proof. The set {f{x)~'^ ffl f{y) ^ di \ x £ B{gJ),y £ B{g,i),i £ 1} con- 
tains the unavailable displacement values. It has size < \B{g,j)\ ^)l ^ 

Ei,iB(p,i)|>i \B{g,iW < using first non-increasing order, \B{gJ)\ > 1, and 
then r-goodness. Hence there must be more than (1 — ^)\D\ good displacement 
values in D. □ 

Lemma 4 implies that when a > + e)n and b > (2c^r-fe)n, an r-good pair 

of hash functions can be found (and verified to be r-good) in expected time 0(n). 
The proof of Theorem 5, and in particular the claim, shows that for a 1 + e-good 
pair (/, 5^), the strategy of trying random displacement values in D successfully 
displaces all blocks with more than one element in expected 1/e attempts per 
block. When 0{n) random elements of D can be picked in 0(n) time, this runs 
in expected time 0{nle) = 0{n). Finally, if displacing all blocks with only one 
element is easy, the whole construction runs in expected time 0(n). 



2.2 Instances of the Class 

We defined our class at a rather abstract level, so let us look at some specific 
instances. For simplicity, we use universal classes with constant 1. The number 
of displacement values, 6, must be at least (2 + e)n, e > 0, for expected linear 
time construction. Note that it is possible to “pack” all displacement values into 
b\\ogn] bits. 

Addition Using the integers with addition and D = {0, . . . , n - 1}, we get the 
class Hz = {h \ h{x) = f{x) + dg(^x)J ^ Hn/ 4^9 £ < di < n}. The 

range of hash functions in the class is {0, . . . , |n - 2}, so it is not minimal. 
Addition Modulo n The previous class can be made minimal at the expense 
of a computationally more expensive group operator, addition modulo n: 
Hzn = {h I h{x) = {f{x) + dgi^x)) mod n, / G Hn,g £ Hb,0 < di < n}. Note 
that since the argument to the modulo n operation is less than 2n, it can 
be implemented using one comparison and one subtraction. Pseudo-code for 
the construction algorithm of this class can be found in [9]. 
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Bitwise Exclusive Or The set of bit strings of length i = [logn] form the 
group Z 2 under the operation of bitwise exclusive or, denoted by ©. We let 
{0, . . . , n - 1} correspond to ^-bit strings of their binary representation, and 
get == {h I h{x) = /(^)0d^(x)?/ ^ ^ 2 ^- 1 5 P ^ H^^di G {0,1}^}. The 
range of functions in this class is (corresponds to) the numbers {0, . . . , 2^ — 1}. 
It is thus only minimal when n is a power of two. Using more displacement 
values the class can be made minimal, see [9]. 

2.3 Using Only One Multiplication 

A variant of the scheme presented here uses only one universal hash function 
and thus offers efficiency of evaluation close to the lowest one conceivable, see 
the lower bound in [1]. The idea is to pick a randomly chosen function from a 
strongly universal class of hash functions with range The two 

functions needed are obtained by composing with functions picking out the log a 
most significant and the log 6 least significant bits. An analysis similar to that 
in Lemma 4 shows that the pair of functions constructed in this way is r-good 
with probability more than 1 - The number of displacement values 

needed for a = n is therefore (y -f e)n. Details can be found in [9]. 

Using the strongly universal class of Dietzfelbinger [4], one multiplication, a 
few additions and some simple bit operations suffices for the evaluation of the 
resulting perfect hash function. 

3 Proof of Probe Optimality 

This section serves to prove that one cannot in general do better than two probes 
into the description of a perfect hash function. Of course, for large word length, 
w, one probe does suffice: If in > flogn] + \u/b] then a bitmap of the whole set, 
as well as rank information in each word, can be put into b words, and one-probe 
perfect minimal hashing is easy. This bound is close to optimal. 

Theorem 6. Let H - {/ii, . . . , hi : U {0,...,a - 1}, where u > 2a, 
be a class of perfect hash functions for (^), n >2. Assume that the functions 
can be described using b words of size w > logu, and can be evaluated using one 
word-probe. Then w > — 1, where c > 0 is a constant. 

Proof. Denote by Ui the subset of U for which word i is probed. We will 
choose B C {0, . . . , fo - 1} such that Ub — ^iesUi has size at least 2a. By the 
pigeon hole principle, B can be chosen to have size \^] • The crucial observation 
is that the words given by B must contain enough information to describe a 
perfect hash function for any n-subset of Ub- By [8, Theorem III. 2. 3. 6] such a 
description requires at least ~ 2 \Ub \\^2 ~ ^ ^ ~ ^ Therefore 

(1 H- > \B\w > ~ which the stated bound follows. □ 

Corollary 7. In the setting of Theorem 6, if a — 0{n) then b = Q{u/w) words 
are necessary. 
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4 Conclusion and Open Problems 

We have seen that displacements, together with universal hash functions, form 
the basis of very efficient (minimal) perfect hashing schemes. The number of 
memory probes needed for evaluation is optimal, and the variant sketched in 
Sect. 2.3 needs only one “expensive” arithmetic operation. 

The space consumption in bits, although quite competitive with that of other 
evaluation-efficient schemes, is a factor of 0(logn) from the theoretical lower 
bound [8, Theorem III. 2. 3. 6]. Some experimental studies [6] suggest that the 
space consumption for this kind of scheme may be brought close to the optimum 
by simply having fewer displacement values. It would be interesting to extend the 
results of this paper in that direction: Can low space consumption be achieved 
in the worst case, or just in the average case? Which randomness properties 
of the hash functions are needed to lower the number of displacement values? 
Answering such questions may help to further bring together the theory and 
practice of perfect hashing. 
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Abstract. Shared-memory multiprocessors feature parallelism and a 
steep cache hierarchy, two saHent characteristics that distinguish them 
from the commodity processors of ten years ago. Both of these char- 
acteristics can be exploited effectively using the same general strategy: 
divide-and-conquer recursion. This talk overviews the Cilk multithreaded 
programming language being developed in the MIT Laboratory for Com- 
puter Science, which allows a programmer to exploit paraUehsm through 
divide-and-conquer. In addition, I show how divide-and-conquer allows 
caches to be used effectively. 

In the first part of my talk, I introduce the Cilk programming language. 
Cilk minimally extends the C programming language to allow interac- 
tions among computational threads to be specified in a simple and high- 
level fashion. Cilk’s provably efficient runtime system dynamically maps 
a user’s program onto available physical resources, freeing the program- 
mer from concerns of communication protocols and load balancing. In 
addition, Cilk provides an abstract performance model that a program- 
mer can use to predict the multiprocessor performance of his application 
from its execution on a single processor. Not only do Cilk programs scale 
up to run efficiently on multiple processors, but unlike existing parallel- 
programming environments, such as MPI and HPF, Cilk programs “scale 
down”: the efficiency of a Cilk program on one processor rivals that of 
a comparable C program. I illustrate Cilk programming through the ex- 
ample of divide-and-conquer matrix multiplication. 

The second part of my talk presents a strategy for designing algorithms to 
exploit multiple levels of caching effectively. Unlike previous algorithms, 
these algorithms are ’’cache oblivious”: no variables dependent on hard- 
ware parameters, such as cache size and cache-line length, need to be 
tuned to achieve optimafity. Nevertheless, I show that cache-oblivious 
algorithms can be developed that use an optimal amount of work and 
move data optimally among multiple levels of cache. Problems that can 
be solved efficiently by cache-oblivious algorithms include matrix multi- 
plication, FFT, sorting, and matrix transpose, all of which are solved in 
a divide-and-conquer fashion. 

Together, these two technologies provide a foundation for the design and 
analysis of efficient algorithms for shared-memory multiprocessors. 
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Abstract. We consider three closely related optimization problems, aris- 
ing from the graph drawing and the VLSI research areas, and conjectured 
to be NP-hard, and we prove that, in fact, they are NP-complete. Start- 
ing from an orthogonal representation of a graph, i.e., a description of 
the shape of the edges that does not specify segment lengths or ver- 
tex positions, the three problems consist of providing an orthogonal grid 
drawing of it, while minimizing the area, the total edge length, or the 
maximum edge length, respectively. 



1 Introduction 

The orthogonal drawing standard is recognized to be suitable for several types 
of diagrams, including data flow diagrams, entity-relationship diagrams, state- 
transition charts, circuit schematics, and many others. Such diagrams are ex- 
tensively used in real-life applications spanning from software engineering, to 
databases, real-time systems, and VLSI. 

A well known approach to produce orthogonal drawings is the topology- 
shape-metric approach (see, for example, [14, 5, 1, 9, 3]), in which the graph draw- 
ing process is organized in three steps: 

i) the planarization step determines the topology of the drawing, described 
by its planar embedding, i.e., the order of the edges around each vertex. The 
purpose of this step is to minimize edge crossings. 

ii) the orthogonalization step determines the shape of the drawing: each 
edge is equipped with a list of angles, describing the bends featured by the 
orthogonal line representing the edge in the flnal drawing. The purpose of this 
step is the minimization of the total number of bends. 

iii) the compaction step determines the flnal coordinates of vertices and 
bends. The name of this last step originates from the fact that an aesthetic 
measure between area, total edge length, or maximum edge length is hopefully 
minimized. 

The compaction problem is precisely the optimization problem consisting 
of minimizing one of the three measures just mentioned, while performing the 

* Research supported in part by the CNR Project “Geometria Computazionale Ro- 
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compaction step: in particular we call Orthogonal Area Compaction (OAC), Or- 
thogonal Total Edge Length Compaction (OTELC), and Orthogonal Maximum 
Edge Length Compaction (OMELC) the three problems, respectively. 

Finding the intrinsic computational complexity of the compaction problem 
has been for a long time an elusive goal. Decades of intense research in the field 
of orthogonal graph drawing have not affected our knowledge in this respect: 
the problem is mentioned as open in foundating papers as in recent ones ([15,8, 
10]). As far as we know, the only contribution to this subject is the early result 
contained in [4], where the trivial case of not connected graphs is demonstrated 
to be NP-complete. 

The compaction problem has been one of the challenging tasks in the VLSI 
research field too, where the requirement of minimizing the size of a circuit 
layout while preserving its shape, led to formulations similar to those arising 
in the graph drawing area, although, for VLSI purposes, vertices are possibly 
replaced by squares and additional constraints (e.g. on the length of specific 
edges) are generally considered. Since several VLSI formulations, related with 
the compaction problem, are proved to be NP-hard [11], compacting orthogonal 
representations is widely believed to be an NP-hard problem too. 

From the practical point of view, we have both exhaustive approaches finding 
optimal solutions and heuristic approaches producing suboptimal ones. To the 
first class belongs the ILP formulation presented in [10], relying on branch-and- 
cut or branch-and-bound techniques to find an optimal solution. To the second 
class belong both the “rectangular refinement” approach proposed in [14], and 
the “turn regularization” approach proposed in [2], based on the fact that the 
compaction problem is tractable when all faces of the orthogonal representation 
are rectangular or “turn-regular” , respectively. 

In this paper, by means of a reduction from the SAT problem, we prove that 
compacting an orthogonal representation of a connected graph, while minimiz- 
ing an aesthetic measure between area, total edge length, and maximum edge 
length is an NP-complete problem. To accomplish this, after formally defining 
the three problems in Section 2, we construct in Section 3 an orthogonal rep- 
resentation corresponding to a formula 0 of the SAT problem and admitting 
a number of orthogonal grid drawings with minimum area, that are in one-to- 
one correspondence with the truth assignments satisfying 0. The previous result 
is easily extended to the problems of compacting an orthogonal representation 
while minimizing the total edge length and the maximum edge length. 

Carefully exploiting the same constructions presented in Section 3 it can be 
proved (see [13]) that the three problems are not in PTAS, i.e., that they do not 
allow a polynomial-time approximation scheme. 

2 Preliminaries 

We assume familiarity with basic graph theoretic concepts and graph draw- 
ing terminology (see, e.g., [7] and [3], respectively) concerning planarity, planar 
graphs, and planar representations. 
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In a planar orthogonal drawing T of a graph G, vertices are placed on distinct 
points of the plane and edges are alternating sequences of horizontal and vertical 
segments, non intersecting except at edges common endpoints. 

We consider, without loss of generality, only orthogonal drawings with no 
bends, since each bend can be replaced by a dummy vertex of degree two. 

Let / be a face of an embedded 4-planar graph G, and F an orthogonal draw- 
ing of G. We associate with each pair of consecutive edges (possibly coinciding) 
of /, a value a, such that a • tt/2 is the angle formed by the two consecutive 
edges into /. An orthogonal representation of G describes an equivalence class of 
planar orthogonal drawings of G with “similar shape” , that is with the same a 
values associated with the angles around vertices (a more formal definition can 
be found in [14,2,3]). 

An orthogonal grid drawing F of a. graph G (without bends) is an orthogonal 
drawing such that vertex coordinates have integer values; its area is the area of 
the smaller rectangle including it; its total edge length is the sum of the lengths 
of its edges; and its maximum edge length is the maximum value of all its edge 
lengths. 

This paper is concerned with the complexity of producing an orthogonal grid 
drawing F starting from its orthogonal representation H while minimizing the 
area of the drawing, the total edge length, or the maximum edge length. The 
three minimization criteria are considered to have roughly the same aesthetic 
effect: that of reducing the size of the drawing (or of part of it) and so improve 
its readability. However, conflicts between the three requirements (see [13]) imply 
that they constitute different, although closely related, optimization problems. 

Following an usual technique (see, e.g., [6,12]), rather than addressing di- 
rectly the three optimization problems we will consider their corresponding deci- 
sion versions according to which the Orthogonal Area Compaction (Edge Length 
Compaction, Maximum Edge Length Compaction, respectively) problem con- 
sists in considering an orthogonal representation if of a graph G and a constant 
AT, and deciding whether integer coordinates can be assigned to the vertices of 
G so that the area (total edge length, maximum edge length, respectively) of 
the drawing is less or equal than K. 

We will show in the next section that the three problems above are NP-hard 
and are in NP. This is summarized in the following theorem: 

Theorem 1. The OAC, OTELC, and OMELC problems are NP-complete. 

3 NP-Completeness of the OAC, OTELC, and OMELC 
Problems 

Since is quite easy to produce three nondeterministic Turing machines that de- 
cide the three problems in polynomial time (see [13]), we take for granted that the 
problems are in NP, and tackle the more challenging NP-hardness proof. First, 
by means of a reduction from the SAT problem, we prove the NP-hardness of 
the Orthogonal Area Compaction problem, and then we extend this result to 
the remaining two. 
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We recall that the SAT problem consists in deciding whether truth values 
can be assigned to a set of variables in such a way that each clause of the input 
set contains at least one true literal. Given a formula 0 in conjunctive normal 
form with variables , . . . , Xn and clauses Ci , . . . , Cm, we produce an orthogonal 
representation H{(j)) and a constant K{(j)) such that an orthogonal grid drawing 
of area less or equal than AT(0) exists if and only if (j) is satisfiable. 

Please, notice that in the SAT definition all the variables in the same clause 
can be assumed to be different, i.e., the version of SAT in which each clause con- 
tains appearances of distinct variables is also NP-complete (this can be trivially 
proved by introducing a linear number of dummy variables and further clauses). 

In order to construct the orthogonal representation H{(j)) we first build a 
clause-gadget for each clause Ci. The clause-gadget is composed by n cham- 
bers, one for each variable, whether the variable actually occurs in the clause 
Ci or not. The darkened area of Fig. 1 represents the clause-gadget for the first 
clause of a formula (j) with four boolean variables (edge lengths are not mean- 
ingful, since the figure is only meant to describe an orthogonal representation). 
In what follows we will call weldings the vertices shared by two adjacent cham- 
bers of the same clause-gadget (black vertices of Fig. 1). All the clause-gadgets 
of formula (j) are placed one upon the other, so that each chamber shares its 
bottom (top) vertices with the corresponding chamber of the clause-gadget be- 
low (above). Furthermore, vertical paths are introduced to separate weldings of 
different clause-gadgets (again, see Fig. 1). Informally speaking, each column 
of chambers corresponds to a variable of formula 0, and if an orthogonal grid 
drawing features a column of chambers in a “lower position” with respect to the 
weldings (like the first and third columns of Fig. 1), then the corresponding vari- 
able is assigned a false value, while if the orthogonal grid drawing has a column 
in the “higher position” (like the second and fourth columns of Fig. 1), then the 
corresponding variable is assigned a true value. 

Two types of subgraphs are inserted inside the chambers: obstacles (attached 
inside each chamber), and pathways (traversing all the chambers of the same 
clause-gadget). Observe in Fig. 1 how the obstacles of different chambers dif- 
fer, depending on whether the literal they correspond to either appears in the 
clause with a positive value, or appears in the clause with a negative value, or 
doesn’t appear at all in the clause. Pathways, instead, have the same shape for 
every clause-gadget, and are composed by a succession of 2n — 1 A-shaped struc- 
tures linked together. They attach to the first and to the last chambers of the 
clause-gadget. Roughly speaking, the role of obstacles and pathways is that of 
preventing the clause-gadgets from assuming a shape corresponding to a truth 
assignment of the boolean variables that doesn’t satisfy at least one literal of 
each clause. In fact, it can be shown that a chamber corresponding to a literal I 
may contain less than two A-shaped structures of the pathway only if either it 
assumes the “lower position” and I is negative or it assumes the “higher position” 
and I is positive. 

Finally, in order to obtain the orthogonal representation H{(f)), an external 
boundary and a belt are added to the construction. The external boundary has a 
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Fig. 1. The orthogonal representation if(0) corresponding to the formula 0 = {x2 V 
X4) A(xi Vx2 Vx3 VX4) A(x 3) A(xi Vx2 Vxa)- The orthogonal grid drawing shown in the 
figure has minimum area and corresponds to the truth assignment: xi = false^X2 = 
true^ xz = false, X4 = true. 



top and bottom sides of 9n H- 3 vertices and a right side of 9m -h 8 vertices. The 
belt is a path inserted between the boundary and the core of the construction 
and is composed by 2 + 24(n — 1) vertices, so that its turn sequence is 
where an r (/) label represents a right (left) turn. The external boundary, the 
belt, and the core of the construction are attached together as shown in Fig. 1. 
Because of the belt, in any orthogonal grid drawing of i?(0) with minimum area 
each column can assume only one of the two positions that we informally called 
“higher” and “lower”. 

The instance {H, K) of the OAC problem is completely defined by assigning 
the value of K{(j)) = (9n + 2) x (9m + 7). 

The NP-hardness of the OAC problem is easily demonstrated as the following 
lemma is proved [13]: 



Lemma 1. An orthogonal grid drawing of area at most K{(j)) of the orthogonal 
representation H (0) exists if and only if formula 0 is satisfiable. 
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In order to prove the NP-hardness of the OMELC and OTELC problems, 
we produce two orthogonal representations, containing the described H{(f)) as a 
subgraph, and such that each orthogonal grid drawing of them that minimizes 
the measure of interest also minimizes the area of the contained 

Namely, for the OMELC problem, first we add to appropriate sub- 
graphs (see [13]) in order to assure an equal number of vertices along the top 
and right sides of it, and then we attach two edges running along these two sides. 
For the OTELC problem we add enough of such vertical and horizontal edges 
so to force the minimization of the total edge length to imply the minimization 
of the newly added edges, i.e., of the area of H{(j)). 
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Abstract. Aicholzer et al. recently presented a new geometric construct 
called the straight skeleton of a simple polygon and gave several combina- 
torial bounds for it. Independently, the current authors defined in com- 
panion papers a distance function based on the same offsetting function 
for convex polygons. In particular, we explored the nearest- and furthest- 
neighbor Voronoi diagrams of this function and presented algorithms for 
constructing them. In this paper we give solutions to some constrained 
annulus placement problems for offset polygons. The goal is to find the 
smallest annulus region of a polygon containing a set of points. We fix 
the inner (resp., outer) polygon of the annulus and minimize the annulus 
region by minimizing the outer offset (resp., maximizing the inner offset). 
We also solve a a special case of the first problem: finding the smallest 
translated offset of a polygon containing an entire point set. We extend 
our results for the standard polygon scaling function as well. 



1 Introduction 

Computing optimal placements of annulus regions is a fundamental aspect of 
quality control in manufacturing. For example, the width of the thinnest circular 
annulus containing a set of points is the ANSI [F, pp. 40-42] and ISO measures 
for testing roundness. The usual goal is to find a placement of the annulus 
that contains a given set or subset of points. Optimality of the placement can 
be measured either by minimizing the size of the annulus region necessary to 
contain all (or a certain number) of the points, or by maximizing the number of 
points contained in a fixed-size annulus. 

* Work on this paper by the first author has been supported by the U.S. ARO under 
Grant DAAH04-96-1-0013. Work by the second author has been supported by the 
NSERC of Canada under grant OGP0183877. Work by the third author has been 
supported by the NSF under Grant CCR-93-1714. 




63 



One set of problems studied recently in [BBDG] involves the optimal place- 
ment of polygonal annulus regions. It was noted that the polygonal annulus can 
be defined as the difference region either between two scaled copies of some poly- 
gon P or between two offset copies of P. This work gave solutions for the offset 
version of the problems for both definitions of optimality given above. 

In this paper we continue the investigation of the polygon offset operation, 
especially as it defines a distance function and the related Voronoi diagram 
(see [BDG]). We prove some new properties of this offset-polygon distance func- 
tion. We then explore a new set of convex-polygon annulus placement problems 
where one of the two annulus boundaries (inner or outer) is fixed. This is an 
important aspect of quality control. As an example, if a manufactured object is 
to fit inside a sleeve, then the outer annulus must be fixed. On the other hand, if 
a part must fit over a peg, then the inner annulus is crucial. For circular annuli 
it was shown [BBBR] that when either the inner disk or the outer disk is of fixed 
size, the placement problem can be solved more efficiently. We solve problems 
for polygonal annulus regions defined for both the polygon offsetting function 
and the normal convex polygon scaling distance function. In particular, we give 
algorithms for the following problems: 

Problem 1. Given a convex polygon P and a set of points 5, find a placement 
T that maximizes an inner polygon, such that all points in S lie in the annulus 
region between P and a smaller offset (resp., scaled) copy of P. 

Problem 2. Given a convex polygon P and a set of points S, find a placement 
r that minimizes an outer polygon, such that all points in S lie in the annulus 
region between P and a larger offset (resp., scaled) copy of P. 

Note that in both cases we are looking for placements of the polygon con- 
taining all points. The following problem is a special case of Problem 2: 

Problem 3. Given a convex polygon P and a set of points 5, find the smallest 
offset (resp., scaled) copy of P containing all the points in S. 

A substep of several approaches to the above problems is the following: 

Problem 4- Given a convex polygon P and n translations, find the intersection 
of the n translated copies of P. 

We first present some approaches to Problems 3 and 4, and then provide 
subquadratic-time algorithms for Problems 1 and 2. Our algorithm for Problem 1 
requires 0((m -f- n)log(m -h n)) time (for scaling) and 0(n(logn -h log^m) -f 
m(logn-flogm)) time (for offsetting), where n is the number of points in S and 
m is the complexity of P. Our algorithm for Problem 2 requires 0(nmlog(nm)) 
time (for both scaling and offsetting) . 

1.1 The Offset Operation 

We now briefiy discuss the polygon-offset operation. This operation was studied 
by Aicholzer et al. [A A, A A AG] in the context of a novel straight skeleton of 
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a polygon. Barequet et al. [BDG] also studied the polygon-offset operation in 
a different context, that of a new distance function and the related Voronoi 
diagram. They give efficient algorithms for computing compact representations 
of the nearest- and furthest-neighbor diagrams. Polygon offsets were also used 
in the solution to various annulus placement problems [BBDG]. In this paper 
we adopt the terminology of [BBDG] and refer the reader to Section 1.3 of that 
paper for a more precise definition of the offset operation. Likewise, for a formal 
definition of the corresponding distance function, the reader is referred to [BDG]. 
We give here only a brief definition and description. 

The scaled polygons of Problems 1 and 2 correspond to the convex distance 
functions, which are extensions of the notion of scaling circles (in the Euclidean 
case) to convex polygons. There have been several papers (e.g., [CD,KW,MKS]) 
which explore the Voronoi diagram based on these distance functions. 

By contrast, the outer e-offset of a convex polygon P is obtained by translat- 
ing each edge e G P by e in a direction orthogonal to e and by extending it until 
it meets the translations of the edges neighboring to e. The edge e' is trimmed 
by the lines parallel and at distance e (outside of P) of the neighboring edges of 
e. The inner offset is defined in a similar way: For each edge e G P we construct 
an edge e" on a line parallel to e and trimmed by the offsets of the neighbor- 
ing edges. The edge e" may “disappear” for a sufficiently large e. This happens 
when the neighboring offset lines meet “before” they intersect with the line that 
contains e". In fact, the offset operation moves the polygon vertices along the 
medial axis of the polygon, so that an edge “disappears” when two polygon ver- 
tices meet on a medial-axis vertex. We adopt the terminology of [BBDG] and 
denote by Ip^s (resp.. Op, <5) the inner (resp., outer) offset version of P by S. 

In many applications, e.g., manufacturing processes, defining distance in 
terms of an offset from a polygon is more natural than scaling. This is because 
the relative error of the production tool (e.g., a milling head) is independent of 
the location of the produced feature relative to some artificial reference point 
(the origin). Thus it is more likely to allow (and expect) local errors bounded 
by some tolerance, rather than scaled errors relative to some (arbitrary) center. 



2 Preliminary Observations 

We first define the meaning of a placement of a polygon. Throughout the paper 
we assume that each polygon has a fixed reference point. For scaled polygons, we 
use the center of scaling which is assumed to be contained by the polygon. For 
offset-polygons, the natural reference is the offsetting center, which is the point 
to which the inner polygon collapses when the polygon is offset inward. (This 
point is the center of the medial axis of the polygon [BDG].) In the degenerate 
case the offset center is a segment, so we arbitrarily select a point of it as the 
center, say, the median of the segment. By translating a copy of the polygon P 
to some point g, we mean the translation of P that maps its center (reference 
point) to q. Similarly, when we speak of the reflection of P, we mean the rotation 
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of P by 7 T around its center point. The translation of a reflection of P to a point 
q translates the polygon so that the center of the reflected copy is mapped to q. 

Some of the following observations are analogous to well-known facts with 
respect to other distance functions. It is not obvious, however, that the properties 
of Euclidean distance hold for the offset-polygon distance function. The offset- 
polygon distance function is not a metric [BDG]. In fact, like the more common 
Minkowski functions (scaled polygon distance) it is not even symmetric. It is 
proven in [BDG] that the offset-polygon distance function does not satisfy the 
triangle inequality, and in fact, for collinear points it satisfies a reverse inequality. 

Our algorithms use Voronoi diagrams based on the scaled (Minkowski) or 
offset distance functions. In both cases the bisector of two points p, q is (in the 
non-degenerate case) the polyline that contains all points x for witch d(p, x) — 
d{q, x). We could also define it symmetrically as the set of points x s.t. d{x,p) — 
d{x,q)^ which identifies with the first definition when we reflect the underlying 
polygon. Since neither distance function is symmetric, these two definitions result 
in different bisectors. However all of the following observations and lemmas hold 
regardless of which definition is used. 

Observation 1 The bisector between two points p and q has a segment s (the 
median of the bisector, analogous to the midpoint of pq in the Euclidean dis- 
tance function), such that the distance from p and q to x, for any point x on 
s, is minimum of all points on the bisector. In both directions along the bisector 
from s, distances from p and q to points on the bisector are monotonically non- 
decreasing. In the case that the defining polygon P does not have parallel edges, 
the median is always a single point x. 

This can be seen by examining the pair of smallest offset (or scaled) copies of 
P placed at points p and q that touch. Since the polygons are convex, the inter- 
section is a segment or a single point. As the polygons grow outward from this 
point, the median (point or segment) is completely contained in the intersection 
of any larger copies. We use the same idea to illustrate the following: 

Observation 2 Given a point p, a line L, and a point q ^ L, there exists a 
direction along L from q s.t. d[x,p) > d{q,p) for all points x in that direction. 

Note that this is true for d{p,x) > d{p,q) as well as for d{x,p) > d{q,p), but 
the directions might be different! 

We now bound the region in which the fixed-size polygon (i.e., its center) can 
be placed. For Problem 1 we ask where the possible placements of the fixed-size 
outer polygon that contain all the points are, and for Problem 2 we ask where 
the possible placements of the fixed-size inner polygon, that do not contain any 
of the points, are. These are the sets of “feasible placements.” In the sequel P is 
a polygon, Pr is its reflection, and ^(Pr) is a translation of Pr to the point q. 

Observation 3 A translation r of P contains q if and only if ^(Pr) contains 
the point to which r translates (the center of) P. 
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Generalizing this for a set 5 of n points we have: 

Observation 4 A translation r of a polygon P contains all the points of a set 
S if and only if r translates P to a point in the intersection of the n copies of 
Pr translated to the points of S. 

Based on the last observation we define a feasible region for placements of the 
annulus region in Problem 1, which is the intersection of n refiected copies of P 
translated to each point in S. This feasible region, according to Observation 4, 
contains all possible placements where the fixed outer polygon contains all the 
points in 5. The goal then becomes to find the largest inner polygon that can 
be placed inside this region without containing any point of S. If the feasible 
region of the outer polygon is already empty, then there is no solution at all. 

A solution to Problem 4 thus provides us with the feasible region. There is 
an analogous idea for Problem 2, where we are interested in finding a placement 
of the inner polygon such that it does not contain any point of S. 

Observation 5 A translation r of a polygon P contains no points of a set S of n 
points if and only if r translates P into the region determined by the complement 
of the union of n reflected copies of P translated to the points of S. 

Based on this observation, we define a different feasible region for placements 
of the annulus region in Problem 2. The feasible region is given by the comple- 
ment of the union of n refiected copies of P translated to each of the points in 5, 
plus the boundary edges of the region. This feasible region consists of all possible 
placements where the fixed inner polygon does not properly contain any of the 
points in 5. The goal then becomes to find the smallest outer polygon (scaled 
or offset) that can be placed inside this region while containing all points in S. 



3 Intersecting Copies of a Convex Polygon 

In this section we describe several alternative approaches to solving Problem 4. 
It is shown in [BDP] how the prune-and-search technique of Kirkpatrick and 
Snoeyink [KSn] can be used for finding the intersection points of two translated 
copies of a convex polygon. We now describe several ways to compute all the 
vertices of the polygon that is the intersection of n translations of a polygon with 
m vertices. These algorithms use well-known techniques but are included here 
for completeness. There are several other competing approaches as well, which 
we do not outline here. The resulting running times of the following approaches 
are 0(nm), 0{nlogh + m) (where h is the number of hull points), or 0{n{\ogn + 
logm) -h m). The third approach is always inferior to the second approach. 



3.1 Brute Force 

One “brute force” approach is to start with two copies of the polygon, and 
compute their intersection using any of several algorithms for intersecting convex 
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polygons in 0{m) time. Each of the remaining n- 2 polygons can then iteratively 
be intersected with the polygon resulting from the previous step. After each step 
the resulting intersection is still a convex polygon with at most m edges, each 
parallel to an edge of the original polygon. Thus each step requires 0{m) time 
for a total of 0{nm). This brute force approach is not only simple, but is linear 
in n, and so is a good approach when m is small. 

A second brute force approach relies on a simple observation. Each edge 
in the output polygon P* is determined by a single translated polygon from 
the input set — in particular, by that polygon which is extremal in the direction 
orthogonal to and toward the center of the polygon. The algorithm iterates 
through the m edges of P, and for each edge ei determines in 0(n) time which 
of the n input polygons might contribute edge ei to the output. We iteratively 
construct P*, adding an edge at a time and eliminating edges that are cut off. 
Note that the addition of a new edge may eliminate more than one edge. The cost 
of adding edge is 0(1 + c^), where Ci is the number of earlier edges removed by 
6i. Since the total number of edges is at most m, the sum of the c^’s is also m. 
This approach reverses the roles of the inner and outer loop from the previous 
algorithms. The running time of this approach is also 0{nm). 

3.2 Using the Convex Hull 

We can modify both approaches by using of the following lemma and its corollary. 

Lemma 1. A convex polygon contains all points in a set S if and only if it 
contains all vertices of the convex hull of S. 

Corollary 1. The intersection of n translated copies of a polygon P placed at 
each of n points from a set S is the same as the intersection of h translated 
copies of P placed at the h vertices of the convex hull of S. 

Thus we can eliminate non-hull points in a preprocessing step. Depending 
on the relationship between h and m, the speedup in the later computation may 
pay for the cost of computing the convex hull. We compute the convex hull in 
0{nlogh) time [KSe]. Applying the first brute force approach only to the h hull 
points, we get a total running time of 0{n log h -h hm) which is an improvement 
if m = uj{\ogh) and h — o{n). The second brute force approach can be improved 
even further. Given the convex hull of a set of points, we can compute extremal 
points in all m directions in rotational order in 0(m -h h) time by using the 
“rotating-calipers” method [To], or in O(mlogh) time by performing a binary 
search for each of m directions. The output polygon P* is still constructed an 
edge at a time. As in the previous method, the overall number of eliminated 
edges is m. The overall running time is thus 0{n log h -h m) oi 0{{n + m) log h). 



3.3 A Furthest-Neighbor Voronoi Diagram Approach 

A final approach makes use of a Voronoi diagram. First we compute a compact 
representation of the furthest-neighbor Voronoi diagram of the n points. For 




68 



convex distance this is done by the algorithm of [MKS] in 0(n(logn+logm)+m) 
time. For convex-offset distance this is done by [BDG] in 0(n(logn+log^ m)+m) 
time.^ Now, we follow the first (out of three) step of Lemma 2.1 of [DGR, 
p. 124], which constructs the intersection of n congruent circles in 0{n) time. 
Specifically, in [DGR] the authors find the portion of the intersection in each 
cell of the furthest-neighbor Voronoi diagram by a simple walking method. We 
observe that they amortize the number of jumps between cells of the diagram, 
and obtain (for the circles case) an 0{n) time bound due to the complexity of 
the diagram. For both our distance functions we do this in 0(n log m) time: 
From the compact representation of the diagram we explicitly compute only the 
portions that belong to the intersection. This happens 0{n) times, for each we 
spend O(logm) time. The size of the accumulated output is 0(n log m). In total 
we have 0(n(log n -h log m) 4- m)- and 0(n(log n + log^ m) -h m)-time algorithms 
for the scaling and offsetting distance functions, respectively. 

Since we are interested in the intersection of copies of the original polygon, 
for which the unit scale or offset identify, it does not matter which distance 
function is used. So we may prefer to choose the respective Voronoi diagram 
of the scaling function, which provides a slightly faster algorithm. The Voronoi 
diagram approach may also be modified with the precomputation of the convex 
hull, however the result is asymptotically slower than the second brute force 
approach when the convex hull is used. 

4 Smallest Enclosing Polygon 

In this section we solve Problem 3 which is a special case of Problem 2, in which 
the inner “radius” of the annulus is 0: we seek the translation of a minimum 
offset or scaled version of a polygon P, so as to cover a set 5 of n points. 



4.1 Shrinking the Feasible Region 

Our first approach makes use of the results of the previous section. 

1. Compute an offset (or scaled) version of P (denoted as P* = Op^s^ for some 
(5 > 0) large enough so that there exists a placement of P* containing S. 

2. Compute the intersection J of n reflected copies of P* translated to the 
points of 5. 

3. Shrink J (by reducing 5) until it becomes a single point. 

The first step is straightforward. In 0(n) time we find B, the axis-parallel 
bounding-box of 5. We then scan around P in 0(m) time to find any sized 
rectangle enclosed in P, and grow P (either by scaling or offsetting) enough 
so that its interior rectangle encloses B. Denote the resulting polygon (J scale 
or offset of P) as P*. By Observation 4, it is guaranteed that the region J 

^ A bound of 0(n(log n-f-logm)-f m) was erroneously claimed in [BDG]. The corrected 
analysis is found in the full version of that paper. 
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(computed in the second step), that contains all placements of P* that fully 
cover 5, is non-empty. Furthermore, the region J is convex, with edges parallel 
to the original edges of P*, and thus the complexity of J is 0{m). 

The crucial observation is that by reducing 6 (in the third step), J shrinks 
too until it becomes a single point defining the placement and size of the smallest 
copy of P that fully contains 5. This yields the algorithm: in the second step we 
compute the intersection J of the n refiections of P* (the region of all placements 
of P* fully covering S'), and in the third step we decrease 6 until J shrinks into 
a point. We use the medial axis center (or the equivalent scaling center) of J to 
determine the point (or segment) to which the polygon shrinks. 

A solution to the second step was described in Section 3. The intersection 
polygon J is computed in 0{nm) or 0{n\ogh -h m) time. 

The third step depends on whether the polygon is offset or scaled. For the 
offset operation, the point to which J shrinks is the center of its medial axis. 
(This is easily seen when we model the effect of reducing (5 on J: the edges of 
J are portions of edges of P translated to the points of S'.) This point can be 
found in 0(m) time by using the method of Aggarwal et al. [AGSS]. 

For scaled polygons we need to slightly modify the method of [AGSS]. The 
method observes that the medial axis of a convex polygon is actually the lower 
envelope of 3-D planes cutting through the edges of P at fixed angles to the 
plane z = 0 that contains P. For the scaling operation, all we need to do is to 
adjust the slope of every plane. It is a function of three points: the origin and 
the two endpoints of the respective edge. Namely, the slopes are no more fixed 
but are proportional to the “speeds” by which the edges move. We keep track of 
which original copy of P each edge in J belongs to, so we can compute all these 
angles and solve the problem again in 0{m) time. 

The time complexity of this algorithm is thus dominated by the second step: 

Theorem 6. The smallest enclosing (scaled or offset) polygon problem can be 
solved in either 0{nm) or 0{n\ogh -f m) time. 

4.2 A Randomized Incremental Approach 

Problem 3 can also be solved by a randomized incremental approach, which is 
a modified version of that described in [BROS, §4.7] for finding the smallest 
enclosing circle. We start with finding the smallest enclosing polygon P 3 of three 
points qi,q 2 ,Q 3 ^ S. We add point qi at the ith step (for 4 < i < n). If is 
contained in P^-i, then Pi = Pi-\. If not, we compute Pi with the knowledge 
that point qi must be one of the constraining points (e.g., qi lies on the boundary 
of Pi). The reader is referred to [BROS] for details. The analysis of the expected 
running time is the same as for circles except that computing the smallest (scaled 
or offset) polygon containing 3 points requires O(logm) time (for scaling) [RSn] 
or O(log^m) time (for offsetting) [BDG], rather than 0(1) time. 

Theorem 7. The smallest enclosing polygon problem can be solved in expected 
time 0 (n log m H- m) (for scaling) or 0 (n log^ m + m) (for offsetting). 
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5 Minimizing the Annulus for a Fixed Outer Polygon 

We now address the problem of minimizing an annulus region by fixing the outer 
polygon and maximizing the inner polygon (Problem 1). 

Theorem 8. Given a convex feasible region of possible translations of a polygon 
P, there exists a largest (scaled or offset) empty polygon (containing no points 
in S ) that is centered on one of the following points: 

1. On a vertex of the nearest-neighbor Voronoi diagram of S; 

2. On an intersection of an edge of this Voronoi diagram with the feasible region; 

3. At a vertex on the boundary of the feasible region; specifically a vertex which 
represents the intersection of two reflected copies of P (as opposed to an 
arbitrary vertex of one reflected copy of P). 

Corollary 2. The optimal placement of the annulus region, when its outer bound- 
ary is fixed, has at least 3 contact points between the set S and the inner or outer 
boundary of the annulus region, at least one of which is in contact with the inner 
boundary (the maximized inner polygon). 

The algorithm for solving Problem 1 is based on Theorem 8. First we con- 
struct the feasible region J by intersecting n convex polygons of complexity m 
in 0{nm) or 0(n log /i + m) time (see Section 3). Next we construct the nearest- 
and furthest-neighbor Voronoi diagrams of S w.r.t. P and the appropriate dis- 
tance function. Compact representations for both diagrams can be computed in 
0(n(log n -h log m) -f m) time (scaling) or in 0(n(log n + log^ m) + m) time (off- 
setting). Finally, we check at most n Voronoi vertices, n intersections between 
Voronoi edges and J, and m vertices of J, to find which allows the maximal 
polygon. For a Voronoi vertex we test containment in J in O(logm) time. For a 
Voronoi edge we find intersections with J in O(logm) time. To find the maximal 
inner polygon, we need to know the distance to the nearest neighbor in S. For 
Voronoi vertices and edges, this is known. For vertices of J we do point location in 
the compact Voronoi diagram in 0(logn+logm) time, and computing the actual 
distance requires additional O(logm) time. The total running time for checking 
the 0(n + m) possible locations is therefore 0(n log m -f m(log n -f log m)) time. 

Theorem 9. The minimum polygon annulus with fixed outer polygon can be 
computed in 0{{rn^-n)\og{m-\-n)) time (for scaling) orinO{n{\ogn-\-\og^m)-\- 
m(logm + logn)) time (for offsetting). 

6 Minimizing the Annulus for a Fixed Inner Polygon 

In this section we address the problem of minimizing an annulus region by fixing 
the inner polygon and minimizing the outer polygon (Problem 2). 

Lemma 2. The feasible region is the complement of the interior of the union of 
the n reflected copies of P placed at points of S. 
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Proof: Follows from Observation 5. □ 

Note that this feasible region may have two different types of vertices. One 
type (denoted a P-vertex) is simply a vertex of a reflected copy of the polygon. 
The second vertex type (denoted an I-vertex) is an intersection of two copies of 
the reflected polygon. The following observation follows from definition. 

Observation 10 An I-vertex is equidistant from two points in S according to 
the polygon distance function. 

Note that if we move counterclockwise around the feasible region, every tra- 
versed P-vertex is a left turn whereas I- vertices are right turns. In particular, 
from the point of view of the feasible region, the angle around a P-vertex that 
belongs to the feasible region is greater than tt. This yields the next observation: 

Observation 11 If e is the edge of a feasible region adjacent to a P-vertex, and 
L is the line containing an edge e, then it is possible to move some distance e > 0 
in both directions along L from the P-vertex without leaving the feasible region. 

Let U be the union of n reflected copies of P placed at the points of S. 
By Observation 10, placing P at an I-vertex of the boundary of U results in P 
having at least two points of S on its boundary. Hence an I-vertex of U is on an 
edge or on a vertex of the nearest-neighbor Voronoi diagram of 5. Furthermore, 
since each edge of this diagram corresponds to two points in 5, and the two 
copies of the reflected polygon associated with those points intersect in at most 
two points [BBDG, Theorem 1], each Voronoi edge can be associated with at 
most two I- vertices. Since the Voronoi diagram (in its compact representation) 
has 0{n) edges (each having at most m segments) and 0{n) vertices, U can have 
at most 0{n) I-vertices. (n polygons can certainly intersect in 0(n^) points, but 
only 0{n) of these points may be I-vertices of the boundary of U.) There may 
also be at most 0{nm) P-vertices. Therefore, the complexity of the boundary of 
U is 0{nm). The polygon U can be computed in O (nm (log n -h logm)) time by 
using a divide- and- conquer approach. We summarize with the following: 

Theorem 12. The union U of n reflected copies of P has 0(n) I-vertices and 
0{nm) P-vertices, and the complexity of its boundary is 0{nm). It can be com- 
puted in 0{nm{\ogn -f logm)) time. 

Given an edge e of the furthest-neighbor Voronoi diagram of 5, we refer to 
the two points of 5 equidistant from e as the generators of e. 

Theorem 13. The center of the smallest enclosing polygon is in the feasible 
region on one of the following: 

1. A vertex of the furthest-neighbor Voronoi diagram; 

2. A point on an edge of the furthest-neighbor Voronoi diagram provided it is 
the median of the bisector of its generators (see Observation 1); 

3. The intersection point of an edge e of the furthest-neighbor Voronoi diagram 
and the boundary of the feasible region that is closest to the median of the 
bisector of the generators of e; or 

4 . An I-vertex of the feasible region (see Observation 2). 
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Corollary 3. The optimal placement with fixed inner polygon and a minimum 
outer polygon has either: 

1. Two points on the outer polygon on a diameter; 

2. Three points on the outer polygon; 

3. Two points on the outer polygon and one on the inner polygon; or 

4- One point on the outer polygon and two on the inner polygon. 

Theorem 13 implies an approach to computing the minimum polygon an- 
nulus. First, compute all the possible locations for the center. Second, for each 
location, compute the size of the annulus. Output the smallest of these annuli. 

First, we need a way of deciding whether a point x is in the feasible region. 
To do this we compute compact representations of the furthest-neighbor Voronoi 
diagram of S based on a reflection of a convex polygon P and the appropriate 
distance function. This requires 0(n(logn + logm) + m) time (for scaling) or 
0(n(logn + log^m) + m) time (for offsetting). We preprocess the diagram for 
planar point location. Once we know the closest point of 5 to x, we can determine 
whether it is in the feasible region or not in 0(logn -f logm) time. 

There are 0(n) vertices in the furthest-neighbor Voronoi diagram. Contain- 
ment in the feasible region can be verified in 0(n(logn + logm)) time: There 
are 0{n) medians in the diagram, each can be verified in 0(logn -f logm) time. 
There are 0(n) vertices on the boundary of the feasible region (see Theorem 12). 
All of these vertices can be verifled in 0(n(logn + logm)) time. To compute the 
intersection point of an edge e of the furthest-neighbor Voronoi diagram and 
the boundary of the feasible region that is closest to the median of the bisector 
of the generators of e, we note that each edge of the diagram is a polygonal 
chain of at most m segments. If the median is on e and is feasible, then no other 
candidate on e is smaller (distance- wise) . Therefore, we need only consider the 
edges where the median is not feasible. In this case, we direct the segments of 
the edge toward the median. For each segment we need only the first intersection 
with the feasible region. This can be viewed as a ray shooting query. For each 
directed segment ^ we seek its intersection point with U that is closest to s. 
Preprocessing U for ray shooting queries is too costly. Instead, we perform two 
plane sweeps to compute the intersections between U and the directed segments, 
one for the segments directed to the left and one for the segments directed to 
the right. After the first intersection for a given segment is found, we remove it 
from the event queue. Therefore, each segment is processed at most twice, once 
when it is placed in the queue and once for its first intersection. Since there are 
0{nm) segments and the boundary of U has 0{nm) segments, each of the two 
sweeps takes 0(nm(logn + logm)) time. All of the candidates are generated and 
verifled in 0(nm(logn + logm)) time. We conclude with the following: 

Theorem 14. The minimum polygon annulus with fixed inner polygon can he 
computed in 0(nm (log n -f logm)) time. 
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Abstract. A new measure, the accommodating function^ for the quality 
of on-line algorithms is presented. The accommodating function, which 
is a generalization of both the competitive ratio and the accommodating 
ratio, measures the quality of an on-line algorithm as a function of the 
resources that would be sufficient for some algorithm to fully grant all 
requests. More precisely, if we have some amount of resources n, the 
function value at a is the usual ratio (still on some fixed amount of 
resources n), except that input sequences are restricted to those where 
all requests could have been fully granted by some algorithm if it had 
had the amount of resources an. The accommodating functions for two 
specific on-line problems axe investigated: a variant of bin-packing in 
which the goal is to maximize the number of objects put in n bins and 
the seat reservation problem. 



1 Introduction 

The competitive ratio [11,19,15], as a measure for the quality of on-line algo- 
rithms, has been criticized for giving bounds that are unrealistically pessimistic 
[1,2,12,14,16], and for not being able to distinguish between algorithms with 
very different behavior in practical applications [2, 12, 16, 18]. Though this criti- 
cism also applies to standard worst-case analysis, it is often more disturbing in 
the on-line scenario [12]. 

The basic problem is that the adversary is too powerful compared with the 
on-line algorithm. For instance, it would often be more interesting to compare 
an on-line algorithm to other on-line alternatives than to an all-powerful off-line 
algorithm. A number of papers have addressed this problem [10,2,6,13,14,18, 
11,21,20,16] by making the on-line algorithm more powerful, by providing the 
on-line algorithm with more information, or by restricting input sequences. 

In this paper, we move in the direction of restricting input sequences. How- 
ever, instead of a “fixed” restriction, we consider a function of the restriction, 
the accommodating function. Informally, in on-line problems, where requests are 

* Supported in part by the esprit Long Term Research Programme of the EU under 
project number 20244 (alcom-it) and in part by SNF, Denmark. 

** Dept, of Mathematics and Computer Science, University of Southern Denmark, Main 
campus: Odense University, Campusvej 55, DK-5230 Odense M, Denmark. 
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made for parts of some resource, we measure the quality of an on-line algorithm 
as a function of the resources that would be sufficient for an optimal off-line 
algorithm to fully grant all requests. More precisely, if we have some amount of 
resources n, the function value at a is the usual ratio (still on some fixed amount 
of resources n), except that input sequences are restricted to those where all re- 
quests could have been fully granted by an optimal off-line algorithm if it had 
had the amount of resources an. 

In the limit, as a tends towards infinity, there is no restriction on the input 
sequence, so this is the competitive ratio. When all requests can be fully granted 
by an optimal off-line algorithm (without using extra resources), the function 
value is the accommodating ratio [3]. Consequently, the accommodating function 
is a true generalization of the competitive as well as the accommodating ratio. 

In addition to giving rise to new interesting algorithmic and analytical prob- 
lems, which we have only begun investigating, this function, compared to just 
one ratio, contains more information about the on-line algorithms. For some 
problems, this information gives a more realistic impression of the algorithm 
than the competitive ratio does. Additionally, this information can be exploited 
in new ways. The shape of the function, for instance, can be used to warn against 
critical scenarios, where the performance of the on-line algorithm compared to 
the off-line can suddenly drop rapidly when the number of requests increases. 

Due to space limitations, all proofs have been eliminated from this paper. 
They can be found in [4]. 



2 The Accommodating Function 

Consider an on-line problem with a fixed amount of resources n. Let p{I) denote 
the minimum resources necessary for an optimal off-line algorithm to fully grant 
all requests from the request sequence I. We refer to I as an a-sequence^ if 
p{I) < a • n, i.e., an a-sequence is a sequence for which an optimal off-line 
algorithm could have fully granted all requests, if it had had resources a • n. 

For a maximization problem, A{I) is the value of running the on-line algo- 
rithm A on /, and OPT(I) is the maximum value that can be achieved on I by 
an optimal off-line algorithm, OPT. Note that A and OPT use the same amount 
of resources, n. 

The algorithm A is c-accommodating w.r.t. a-sequences if c < 1 and for 
every a-sequence /, A{I) > c • OPT(/) — 6, where 6 is a fixed constant for the 
given problem, and, thus, independent of /. 

Let Ca = {c \ A is c-accommodating w.r.t. a-sequences}. The accommodat- 
ing function A is defined as A{a) = supC«. 

For a minimization problem, A{I) is a cost and OPT(I) is the minimum cost 
which can be achieved. Furthermore, A is c-accommodating w.r.t. a-sequences if 
c > 1 and for every a-sequence /, A{I) < c*OPT(/) 4-6, and the accommodating 
function is defined as A{a) = infC^. 
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With this definition, the accommodating ratio from [3] is ^(1) and the com- 
petitive ratio is lima_^oo A{a). In this paper, we only consider a > 1. In figure 1, 
these relationships are depicted using a hypothetical example. 

The extra information contained in the accommodating function compared with 
the competitive ratio can be used in different ways. If the user knows that es- 
timates of required resources 
cannot be off by more than 
a factor three, for instance, 
then ^(3) is a bound for the 
problem, and thereby a bet- 
ter guarantee than the bound 
given by the competitive ra- 
tio. The shape of the function 
is also of interest. Intervals 
where the function is very 
steep are critical, since there 
earnings, compared to the op- ^ accommodating function for a max- 

. . 1 . 1 .11 imization problem, 

timal earnings, drop rapidly. ^ 



on-line 




3 Unit Price Bin Packing (UPBP) 



Consider the following bin packing problem: Let n be the number of bins, 
all of size k. Given a sequence of integer-sized objects of size at most fc, the 
objective is to maxi- 



mize the total num- 
ber of objects in these 
bins. This is a funda- 
mental problem in op- 
timization [9], which 
has been studied in 
the off-line setting, start- 
ing in [7], and its ap- 
plicability to proces- 
sor and storage allo- 
cation is discussed in 
[8]. We only consider 



on-line 

off-line 




Fig. 2; General upper and lower bounds on the accommo- 
dating function for Bin Packing. 



fair algorithms, i.e., an object can only be rejected if it cannot fit in any bin. In 
this problem for a given a, we only consider sequences which could be packed 
— an bins by an optimal fair off-line algorithm. The results proven in this 



m n 



section are summarized in Figure 2. First, upper bounds on the accommodating 
function which apply to all fair deterministic algorithms are given. 



Theorem 1. For any fair UPBP algorithm, if a < ^ and fc > 3, then A(a) < 
2 

2+(a-l)(ik-2) • 

Using a = I above, gives an upper bound on the competitive ratio. 
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Corollary 1. If k > 3, no fair UPBP algorithm is more than Q^-competitive. 

The bound on the accommodating function for a = 1 can be improved. 

Theorem 2. For k>7, any fair UPBP algorithm has an accommodating ratio 
of at most I . 

Now we give lower bounds which apply to all fair deterministic UPBP algo- 
rithms. The first lower bound is on the competitive ratio. 

Theorem 3. The competitive ratio for any fair UPBP algorithm is greater than 

2_i 

—j^, when k >3. 

Next we give a lower bound on the accommodating ratio, v4(l). 

Theorem 4. Let I be an input sequence which can be accommodated within n 
bins, that is, p{I) < n. The performance ratio is greater than | for any fair 
UPBP on-line algorithm. 

The final general lower bound is on the accommodating function. 

Theorem 5. For any fair UPBP algorithm, the accommodating function can be 
bounded by A{a) > n,ax{i+(a-i“it: 2 +(a-i)|} ’ forl<a<2, and k > max{3,n}. 

We next give an upper bound on the accommodating function for First-Fit. 
This upper bound is close to the general lower bound of Theorem 5, so it is 
almost tight. 

Theorem 6. For UPBP, First- Fit has an accommodating function of at most 
i+(„4p-T) ’ for K a <2. 

In later work [5], we show that Worst-Fit has a better competitive ratio 
{'f’WF > 2 ^) First-Fit {rpE < while First-Fit has a better accom- 
modating ratio («4 ff( 1) > |) than Worst-Fit (w4wf( 1) < The choice 

as to which algorithm to use depends on a, i.e., on the actual distribution of 
request sequences. 



4 The Unit Price Seat Reservation Problem (UPSRP) 

As mentioned earlier, the accommodating ratio was introduced in [3] in connec- 
tion with the seat reservation problem, which was originally motivated by some 
ticketing systems for trains in Europe. In this problem, a train with n seats trav- 
els from a start station to an end station, stopping at fc > 2 stations, including 
the first and last. Reservations can be made for segments between any two sta- 
tions. The passenger is given a single seat number when the ticket is purchased, 
which can be any time before departure. The algorithms (ticket agents) attempt 
to maximize income, i.e., the sum of the prices of the tickets sold. In this paper, 
we consider only the pricing policy in which all tickets have the same price, the 
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unit price problem. The ticket agent must be fair, it may not refuse a passenger if 
it is possible to accommodate him when he makes his reservation. We define the 
accommodating function A{a) for the seat reservation problem to be the ratio 
of how well a fair on-line algorithm can do compared to the optimal fair off-line 
algorithm, when an optimal fair off-line algorithm could have accommodated all 
requests if it had had an = n' seats. The accommodating function could help 
the management in determining how much benefit could be gained by adding an 
extra car to the train, given their current distribution of request sequences. 

In [3], bounds on the competitive and accommodating ratios were given. 
Below, these results are extended to the accommodating function. 

Theorem 7. -4(a) > 2 +(fcZ 27 (i- ' ^ ' ) ® lower bound for UPSRP. 

The following result for the accommodating function is very close to that for 
the competitive ratio when n' > ^. 

Theorem 8. .4(a) < 3+2(fc-2)^in(^ ' a-ij upper bound for UPSRP. 

In [3], bounds on the competitive and accommodating ratio for First-Fit and 
Best-Fit are given. Below, these are extended to the accommodating function. 

( 2 1 2 1 \ 

j on upper bound for First- 

Fit and Best-Fit on UPSRP. 

5 Other Problems 

For some on-line problems, using the accommodating function does not neces- 
sarily result in additional insight compared with what is already known from the 
competitive ratio. For example, the obvious way to define the accommodating 
ratio for the paging problem is to consider sequences of requests such that the 
optimal off-line algorithm could accommodate all requests with no swapping. 
In this case, all pages needed fit in main memory, so on-line algorithms would 
also be able to accommodate all requests with no swapping, giving an accommo- 
dating ratio of 1. The lower bound results, which show that any deterministic 
on-line algorithm has a competitive ratio of at least fc, where k is the number 
of pages in main memory, holds even if there are only A: + 1 pages in all. Thus, 
nothing further is said about how much it helps to have extra memory, unless 
one actually has enough extra memory to hold all of a program and its data. 

As an example of a very different type of problem where the accommodating 
function can be applied, we have considered a scheduling problem: the problem 
of minimizing flow time on m identical machines, where preemption is allowed. 
A job arrives at its release time and its processing time is known. 

Theorem 10. >1(1) = 1. Fora> >l(a) G l?(log„P). 

The difference between this lower bound for a > 1 and the lower bound on 
the competitive ratio from [17] is quite small: 0{log^ P) versus i?(log 2 P), i.e., 
for any fixed m, the bounds are the same. 
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Abstract. We consider the approximability of the tsp problem in graphs 
that satisfy a relaxed form of triangle inequality. More precisely, we as- 
sume that for some parameter r > 1, the distances satisfy the inequality 
dist(x, y) <T' (dist(a:, z) + dist(; 2 :, y)) for every triple of vertices x, y , and 
z. We obtain a 4r approximation and also show that for some e > 0 it is 
NP-hard to obtain a (1 + er) approximation. Our upper bound improves 
upon the earlier known ratio of (3r^/2-hr/2) [1] for all values of r > 7/3. 



Introduction, The approximability of the Traveling Salesman Problem (tsp) de- 
pends on the particular version of the problem. Specifically, if the distances are 
Euclidean, the tsp admits a polynomial time approximation scheme [2,14]. If 
the distances form a metric, then the problem is MAX-SNP-hard [16] and 3/2 is 
the best obtainable approximation ratio known [4]. For arbitrary (symmetric) 
distances, however, the problem cannot be approximated to within any factor. 
(This negative result follows by a reduction to the problem of finding a Hamil- 
tonian cycle in a graph.) 

It seems natural that slight variations in the distances of a problem should 
affect the solution to the problem only slightly, and this should hold even if a 
perturbed instance of metric TSP no longer obeys the Z\-inequality. This intu- 
ition suggests parameterizing tsp instances by how strongly they disobey the 
^-inequality. In particular, for parameter r, 

dist(a:,^) < r • [dist (a:,y) -h dist(y,^:)]. 

If r = 1, the distances satisfy the triangle inequality. For r > 1, the distances 
satisfy the relaxed triangle inequality, 

Andreae and Bandelt [1] show a 3r^/2 + r/2-approximation for the param- 
eterized triangle inequality tsp (z^t-tsp). In this paper, we provide a simple 
algorithm that obtains a 4r-approximation to the Ar-TSP, This result is opti- 
mal to within a constant factor. 

The Problem, An instance of the problem is a weighted graph G consisting 
of n vertices (representing cities). Distances between vertices satisfy the Ar- 
inequality and the length of an optimal TSP tour is denoted OPT. The objective 
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of the problem is to find (in polynomial time) a tour having length at most 
/(r)-OPT, where /(r) is a slowly-growing function of r. In this paper we present 
an algorithm having an approximation ratio of 4r. 

Intuition. Unfortunately, an approximation algorithm on a graph obeying the A- 
inequality does not automatically translate into an approximation algorithm on 
a graph obeying the the parameterized i^-inequality. To understand why, notice 
an advantage of the triangle inequality: shortcutting vertices never increases 
distances, no matter how many vertices are avoided. Thus, the tsp on a subset 
of vertices is a lower bound on the tsp of all the vertices. 

This advantage is lost with the parameterized Zl-inequality. Shortcutting 
over £ vertices may increase the distance substantially: the shortcutted distance 
may depend on the parameter Thus, the tsp on a subset of vertices may be 
dramatically larger than the tsp on all the vertices. 

We illustrate this difficulty by examining two well-known approximation al- 
gorithms for the A-tsf. A 2-approximation is obtained as follows: 

1. Find a minimum spanning tree (mst) for G. 

2. Create a multigraph by duplicating each edge in the mst. 

3. Find an Eulerian tour. 

4. Shortcut over nodes to obtain a Hamiltonian cycle. 

The Christofides heuristic [4] yields a 3/2 approximation: 

1. Find a mst for G. 

2. Identify the nodes of odd degree and find a minimum-cost perfect matching for 
these nodes. 

3. Find an Eulerian tour in the multigraph consisting of the mst and the perfect 
matching. 

4. Shortcut the Eulerian tour to obtain a Hamiltonian cycle. 

The performance guarantees follow because the mst is a lower bound on opt 
and the minimum cost perfect matching is a lower bound on opt/2. 

Z\t--tsp. Now consider what happens to these algorithms for the ^r-TSP. The 
MST remains a lower bound on opt, however the minimum cost perfect matching 
on the odd-degree vertices is no longer a lower bound on opt/2. 

The Ar-TSP algorithm of [1] finds an MST and doubles edges to create an 
Eulerian tour. Then, it obtains a Hamiltonian cycle by shortcutting. In the 
algorithm, at most 2 consecutive nodes are removed from any part of the tour, 
and thus the approximation ratio is O(r^). It is never necessary to shortcut more 
than 2 nodes, because the cube of any connected graph is Hamiltonian [19]. (The 
cube of G, denoted G^, contains edge (w,t;) if there is a path from u to v in G 
of at most 3 edges.) This approach cannot obtain an approximation better than 
O(r^), because there are instances when the mst is a factor of O(r^) smaller 
than OPT. 
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0{r) -Approximation Algorithm. In order to obtain a better approximation ra- 
tio, we begin with a tighter lower bound. A graph 5 is 2 node connected if 
the deletion of any node from S leaves it connected. Any Hamiltonian cycle 
is 2 node connected. Thus, the minimum weight 2-node-connected subgraph is 
a lower bound on OPT. Notice that it is NP-hard to find the minimum weight 
2-node-connected subgraph. However, a 2-approximation can be computed in 
polynomial time [10,18]. 

Let denote the square of graph S. (Thus, (w, v) is an edge in if there is 
a path in S between u and v using at most 2 edges.) Our algorithm is described 
below. 



TSP-APPROX (G) 

1. Find an (approximately) minimum cost 2-node-connected subgraph of G. 
Call this graph S. 

2. Find a Hamiltonian cycle in S^. 



In 1970, Fleischner proved that the square of a 2-node-connected graph is 
Hamiltonian [7, 8], and in his Ph.D. thesis Lau provided a constructive proof [11, 
12] that finds a Hamiltonian cycle in polynomial time. We note that Fleischner’s 
result is applied to the tsp in a simpler way by Parker and Rardin [17], who 
show a 2-approximation for the bottleneck-TSP. 

Theorem 1. Algorithm TSP-APPROX is a 4r approximation algorithm. 

Proof. The graph S has weight at most 2 -opt. The goal is to find a Hamiltonian 
cycle in that has low weight. (Notice that it is not sufficient for the analysis 
to bound the total weight of the edges in 5^, because the weight could be a 
factor of 0(nr) larger than the weight of the edges in S.) Thus, we need a more 
careful analysis. 

Consider any Hamiltonian cycle C in S^, and orient this cycle. Let {u,v) be 
a (directed) edge in C. We say that (u,ty) G C crosses (undirected) edges {u,v} 
and {v, w} in S if {u, w} ^ S and otherwise crosses edge {n, w} G S. 

Notice that no Hamiltonian cycle crosses any edge more than 4 times. This is 
because each node u in S is incident to exactly two edges in C, one edge entering 
u and one edge leaving u (see Figure 1(a)). This observation immediately yields 
an 8r-approximation. 

We obtain a 4r-approximation by finding a new Hamiltonian cycle C" that 
crosses every edge in S at most twice. We obtain C" by adjusting overlapping 
edges. There are 2 cases. Figure 1(a) depicts the first case, where {u, v} is crossed 
4 times. The solid lines in the figure represent edges in S. The dotted lines 
represent edges in C. Cycle C can be modified as shown in Figure 1(b), so that 
{u, v) is never crossed. 
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Fig. 1. In (a), edge {«, v} is crossed 4 times by cycle C. In (b), cycle C is modified so 
that {n, v} is never crossed. 




Fig. 2. In (a), edge {u, v} is crossed 3 times by cycle C. In (b), cycle C is modified so 
that {«, v} is crossed once. 



Case 2 is shown in Figure 2(a), where {u, v} is crossed 3 times. Cycle C can be 
modified as in Figure 2(b) so the {u^v} is crossed only once. After fixing these 2 
cases, no edge is crossed more than twice. It is easily seen that identifying the two 
cases and transforming as described above can be accomplished in polynomial 
time. 



Theorem 2. There exists some fixed constant e > 0 such that it is NP-hard to 
approximate ^r-TSP to within a factor better than (1 -f er). 

Proof We use a reduction from tsp with distances one and two which is max- 
SNP-hard [16]. The proof in [16] establishes that there exists an infinite class 
of graphs G with the following properties. Let G = (V^E) be a graph in G- 
There exists a fixed constant <5 > 0 such that either G is Hamiltonian, or every 
Hamiltonian cycle in the complete graph on V requires at least en edges from 
E = V xV -E, Further, given a graph from G it is NP-Complete to distinguish 
which of the above two cases applies. MAX-SNP-hardness of one-two tsp follows 
from this. We use the same class of graphs to obtain our result. Given a graph 
G = {V,E) from G we create an instance of ^r-TSP in the obvious way by 
making the graph complete with edges in E having cost 1, and the rest having 
cost 2r. If G is Hamiltonian the cost of an optimal TSP tour in G is n, otherwise 
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it is at least (1 - 5)n + 2rSn, Since it is NP-Complete to distinguish between 
these two cases the theorem follows. 

Open Problems. Our upper and lower bounds are within a constant factor of 
each other; however, the gap is large. Can we obtain a kr approximation for 
some k smaller than 4? One approach might be to combine the two steps in 
our algorithm in some clever way. It is natural to consider the same relaxation 
of triangle inequality for the Asymmetric TSP problem. Although an O(logn) 
approximation is known with the triangle inequality, no approximation factor of 
the form 0(r^ log n) is known for the relaxed version for any constant £. The 
relaxed triangle inequality is useful in practice (see [6, 15] for applications to 
pattern matching of images) and finding robust algorithms whose performance 
degrades only mildly with r is of interest. 
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Abstract. In the map verification problem, a robxit%; given a (possi- 
bly incorrect) map M of the world G with its position and orientation 
indicated on the map. The task is to find out whether this map, for 
the given robot position and orientation in the map, is correct for the 
world G. We consider the world model with a graph G = {Vg^Eq) in 
which, for each vertex, edges incident to the vertex are ordered cyclically 
around that vertex. This holds similarly for the map M = (Vm,Em)- 
The robot can traverse edges and enumerate edges incident on the cur- 
rent vertex, but it cannot distinguish vertices and edges from each other. 
To solve the verification problem, the robot uses a portable edge marker, 
that it can put down and pick up as needed. The robot can recognize 
the edge marker when it encounters it in G. By reducing the verifica- 
tion problem to an exploration problem, verification can be completed 
in 0(|Vbl X \Eg\) edge traversals (the mechanical cost) with the help of 
a single vertex marker which can be dropped and picked up at vertices 
of the graph world [DJMW1,DSMW2]. In this paper, we show a strategy 
that verifies a map in 0 (|Vm|) edge traversals only, using a single edge 
marker, when M is a plane embedded graph, even though G may not be 
(e.g., G may contain overpasses, tunnels, etc.). 



1 Introduction 

There are several different facets for robot navigation which are important in 
real world environments. One crucial issue is how to deal with cumulative errors, 
which may cause the robot to lose track of its position in the real world. Several 
different approaches are suggested to relate the robot position with the external 
features of the environment using a map [BCR, BRS, GMR, KB, LL, PY]. This 
leads to the task of map construction (mapping), i.e., learning the cognitive map 
from observations, as summarized by Kuipers and Levitt [KL]. Much research 
has been done on the robot mapping problem for different external environments 
[AH, DJMWl, DM, DP, DKP, Kw, PP, RS]. 

Naturally, one major class of maps used in robot navigation is geometric. 
However, it remains an important and difficult problem how to utilize the map 
and match it with the enormous amount of observed geometric information 
for robot decision makings. Alternatively, qualitative maps, such as topological 
graphs, are proposed to model robot environments which usually require much 
less information in comparison with geometric models [DJMWl, DM, DP, KB, 
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LL, RS]. This approach often focuses on a small set of characteristic locations in 
the environment and the routes between them to reduce information necessary 
for robot navigation. Consequently, it simplifies the task of robot decision mak- 
ing. Kuipers and Levitt have proposed a spatial hierarchical representation of 
environments consisting of four levels: sensorimotor level (a robot uses sensors 
to detect local features of the environment); procedural level where the robot 
applies its knowledge of the world to find its place in the world and to follow 
specified routes; topological level which describes places and their connecting 
paths, usually with the graph model; and metric level which includes necessary 
geometric information related to the topological representation [KL, also see 
KB,LL]. These approaches, neither purely metric nor purely qualitative, leave 
certain features of the environment out but keep necessary information helpful 
for robot motion planning. 

The robot’s perception of the world can also be different as a result of the 
different sensors it carries. In many situations, it is assumed that nodes or edges 
traversed previously can all be distinguished. In contrast, it is assumed in [RS] 
that nodes are divided into a small number of classes, for example, white and 
black colors, and can only be recognized as such. Dudek et al. [DJMWl] apply 
the world model introduced by Kuipers and Levitt to a specific situation in which 
no global orientation information is possible. They divide the world into places 
represented by nodes in a topological (i.e., embedded) graph and use an edge 
between two nodes to represent a connecting path between the corresponding two 
places. The robot is assumed not be able to distinguish nodes from each other 
but can recognize a special local geometric feature: a cyclic order of incident 
edges at each node. This emulates the fact that, at the crossroads, paths form a 
cyclic order because of the local planar geometric nature of the surface. Dudek 
et al. [DJMWl] show that it is impossible to learn the graph in general if the 
robot uses only information collected under the above restriction. For instance, 
this happens when every node in the graph, representing the world, has the 
same number of incident edges. On the other hand, they show that in a total of 
0{\V\ X \E\) traversals of edges, the map can be constructed with the help of a 
single marker which can be put down on nodes and picked up by the robot. Deng 
and Mirzaian [DM] showed that using \V\ identical node-markers the robot can 
construct a map of a plane embedded graph G by traversing each edge at most 
a constant number of times (i.e., a total of 0{\V\) edge traversals). 

The world model introduced by Dudek et al. will be used in our discussion 
[DJMWl]. We consider the map verification problem: The robot is given a single 
directed edge-marker and a plane embedded n-vertex map M and the initial 
position and orientation of the robot in the map. The task is to verify the 
correctness of the map in this setting. The environment graph G may or may not 
be planar (e.g., it may contain tunnels, overpasses, etc.). We demonstrate that 
the verification of plane embedded graph maps can be done very efficiently with 
a single edge marker by introducing a map verification strategy which traverses 
each edge at most 4 times. 
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2 The World Model 

As discussed above, the world is an undirected connected graph G = {V,E) 
embedded with no edge crossings on a (not necessarily planar) surface [DJMWl]. 
At each node, the incident edges are embedded locally in the plane (Figure 1). 
The local embedding forms a natural cyclic order for the edges incident to the 
node. When we have specified an edge incident to the node as the reference edge 
for the node, we can name other edges incident to this node with respect to the 
reference edge according to this cyclic order (e.g., in clockwise order). 







Fig. 1. A cyclic order of edges incident to a node. 



The Robot’s Map of the Graph World. In the graph world model, nodes 
usually correspond to intersection of passage ways in the real world. To deal 
with general situations where landmarks may look similar by the robot’s sen- 
sors, Dudek et al. assume the worst possible situation: nodes in the graph are 
indistinguishable to the robot. Therefore, the complete map of the graph is a 
triple (V,E,S), where V and E are the node set and the edge set of the graph, 
and S is the collection of the local planar embeddings of edges incident to each 
node. Note that in Figure 2, we intentionally give an example, where, for sev- 




(a)The graph 



(b) Local embeddings of edges 



Fig. 2. A map. 



eral nodes, the cyclic orderings of incident edges are different from those in the 
topological graph to emphasize the general situation where local orientations of 
edges at each node can be arbitrary. In fact, given any graph G = {V, E) and 
given any set S of cyclic orders of edges incident to a node, there is a surface 
on which the graph can be embedded such that the local planar embedding of 
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edges incident to each node follows the cyclic order of S [HR]. Even in the real 
world, these may happen because tunnels and overpasses can create generally 
embedded surfaces. 

An alternative representation of the map is its cyclically ordered Adjacency 
List Structure. That is, for each node v in the map we associate a cyclically 
ordered list of the nodes adjacent to v. The cyclical ordering is the clockwise 
ordering of the corresponding incident edges around node v. 

Robot Navigation with a Correct Map. Once the correctness of the map 
is guaranteed, the current location of the robot is matched to a node uq on the 
map, and a path from the robot location is matched to an edge eo incident to uq 
on the map, the robot can match all the paths at its current location to edges 
on the map incident to uq. If the robot moves to another location through one of 
the paths, it knows which map node matches the next location it reaches. The 
edge that leads the robot to the next location becomes its reference edge at the 
new location. From the local embedding of edges incident to the new node, with 
the help of the reference edge, it can again match edges incident to this new 
node with paths from its new location. This allows the robot to use the map in 
navigating from one location to another until it reaches its destination. 

The Map Verification Problem. The robot is initially positioned at a certain 
node of an unknown embedded graph environment G and oriented along one of 
its incident edges. It is also given an embedded map M (say, in the form of the 
cyclically ordered adjacency list structure). We let the augmented map M to 
mean the map M plus the initial position and orientation of the robot on the 
map. Let the augmented G be defined similarly. The map verification problem is: 
for an augmented pair (M, G), verify whether M is a correct augmented map of 
the augmented environment G by navigating in the environment and comparing 
the local observations with the given map. 

3 The Plane Map Verification Algorithm 

A notion of face tracing from topological graph theory [GT] is crucial in our 
verification algorithm. The general idea in our algorithm is to trace the faces of 
the augmented map M one by one, mimic the same actions on the environment 
graph, and compare the local observations with the map. For the world model 
as discussed above, the local observations are the degree of the node visited, and 
the presence or absence of a marker at the current node or edge. The intricacy of 
this approach is refiected by the fact that the sequence of these local observations 
for any proper subset of faces is not at all enough even for a partial map but the 
complete sequence uniquely verifies the map. 

Our algorithm uses a single directed edge-marker. When we refer to the map 
M, we will use phrases such as to put down or to pick up the edge-marker or to 
move along an edge. The actual action is carried out on the graph environment 
G, but a corresponding symbolic operation is performed on M . 
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ALGORITHM 1: We have an augmented pair {M,G) and a directed edge- 
marker. M is a given planar embedded map represented, say, by its cyclically 
ordered adjacency list structure, and G is the yet unknown augmented environ- 
ment graph. The cases when M has a single node or edge can be handled and 
proved trivially. Now consider the general case. 

Assume that the robot is initially positioned at node rio of M and oriented 
along edge Cq = {rio^ni). Place an edge-marker on edge Co in the direction of 
robot’s orientation. In general, suppose the robot is currently at node rii and 
is oriented along edge ei = (n^, 71^4.1). Record the degree of node rii and let us 
denote that by di. The robot moves to node nf+i. Now let Ci+i = {rii^i, 711^2) be 
the edge clockwise next to edge ei at node ni+i. Increment i by one and repeat 
the same process. The robot stops before iteration, say, /, when the directed edge 
6/ = (n/,n/4.i) it is about to traverse contains the edge-marker in the direction 
of the traversal. 

At that point, the robot picks up the marker which appears on edge e = (u, u) 
directed from u to u and it is picked up when we are about to move from u to 
V again. The robot has completed tracing a face, say, / of M. Let V{f) = 
(do,di, • • • ,d/) denote the node degree sequence the robot observed during the 
tracing of face /. In our map verification algorithm we will use 'D(f) as the 
signature for face /. Note that the signature of a single face by itself does not 
uniquely identify the topological structure of the face. We will show, however, 
that the collection of all signatures together will. 

The robot then backtracks along the most recent edges traversed (during 
forward tracing of faces) until it reaches the first edge, say, e of M that it has 
traversed only once. (Note that during the backtracking, we consult map M, not 
G, to figure out edge e.) The robot then uses edge e as the starting edge of the 
next face to trace. It places the edge-marker on that edge in the direction not 
yet traversed (according to map M) and follows the same face tracing procedure 
described above. 

To help the backtracking process, the robot can stack up all its forward 
movements by pushing an appropriate id of the edge of M just traversed onto a 
stack of edges. Then, during the backtracking, it pops the top id from the stack 
and performs the reverse move. If during the backtracking the stack becomes 
empty, the robot terminates its traversal: all faces of M have been traced. 

To complete the description of our map verification algorithm we should 
add that the robot mimics the same edge tracing actions on the embedded 
environment graph G as it performed on the map M (e.g., taking the next edge 
clockwise). If ever during the process it notices a mismatch of local observations 
between M and G, it immediately halts and declares the augmented map is 
incorrect. The local observations that the robot matches are during the forward 
movements (tracing faces) and that is observing and matching the degree of the 
current node and the presence or absence of the edge-marker on the current edge 
and its direction on that edge. 

We now summarize the algorithm described above with the following notes. 

- M is the map and G is the graph-like world. 
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- All robot actions take place “mentally” on M and physically on G. 

- Node degrees are measured in G, and counted on M at the same time. A 
clockwise order is assumed available at each node of G and M. 

- As the robot moves, it checks whether its perception in G agrees with its 
expectation based on M. If a discrepancy occurs, then verification fails. 

- Discrepancy of type 1: node degree observed on G not the same as expected 
based on M. 

- Discrepancy of type 2: edge-marker not found where expected (or found 
where not expected) based on M. 

- For each edge e of M, trace{e) indicates how many times has edge e been 
traversed in either direction during the forward face tracing traversals. 

Main algorithm: 

Set ES (edge stack) to empty. 

For each (undirected) edge e in M, set trace{e) 0. 

Select edge eo = (^ 0 ,^ 1 ) out of initial node tiq. 

Place edge-marker on eo in the direction no ni. 

ForwardTraverseFace F of M defined by the directed edge-marker. 

Loop until ES is empty. 

Pop ES to obtain edge e = {ni,rii^i). 
if trace{e) = 2 then Backtrack along edge e. 
else /* trace{e) = 1 */ 

Place the edge marker on e in the direction not yet traversed nj+i -> n^. 
ForwardTraverseFace defined by the directed edge-marker. 

end Loop 
Procedure: 

ForwardTraverseFace defined by edge e = (n<,n<+i) and direction dir = (n^ 

Repeat 

Verify degree of current node of M against G (type 1). 

Traverse edge e in the direction dir. 

Push e, in the direction of traversal, on ES. 
set trace{e) trace{e) -f 1. 

Select the next edge ei out of the current node 
(clockwise wrt to the previous edge traversed), 
set e 4- ei and update dir accordingly. 

Verify whether presence/absence of edge-marker on e agrees on 
M and G (type 2). 
until edge-marker is detected on e. 
pick up the edge-marker, 
end ForwardTraverseFace 



Examples: To appreciate some of the subtleties of this seemingly simple algo- 
rithm, we present several illustrative examples. 
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Example 1: In this example we consider only the actions taken by Algo- 
rithm 1 regarding the map. Figure 3 shows a map M with 3 faces, indicated 
by fi through fs in the order they are traced by the algorithm. The edge 
indicated by a directed arrow and labeled mi is the position and orientation 
of the marker at the starting edge of face fi. The robot tracing face f\ vis- 
its edges {ab,bc,cd,de,ec^cb,ba,ag^gh^hi,igjga), with the signature V{fi) = 
(2, 2, 3, 2, 2, 3, 2, 2, 3, 2, 2, 3, 2). Then it picks up the marker and backtracks along 
edge ag. Positions the marker along edge gi and starts tracing the second 
face /2 = {gi^ih^hg) with signature T>{f 2 ) = (3, 2, 2, 3). Picks up the marker 
and backtracks along edges {gh, hi, ig, gi, ih, hg, ga, ab, be) and places the marker 
along edge ce. It then starts tracing the third face /s = (ce, ed, dc) with signa- 
ture (3, 2, 2, 3). It then backtracks along edges {cd, de, ec, ce, ed, dc, cb, ba). At this 
point the stack is empty. The process is complete. 




Fig. 3. Tracing faces of M by the algorithm. 



Example 2: In this example M is an incorrect augmented map of G as shown 
in Figure 4. The arrows indicate the starting edge of each face. Algorithm 1 does 
not notice the difference between M and G until the fourth face of the map is 
traversed. 





Fig. 4. An incorrect map. 



Example 3: We can simulate the effect of an edge-marker by two distinct 
node-markers, placing them at the two ends of the edge, or three homogeneous 
node-markers by placing one at one end and two at the other end. However, in 
this example we show that using a single node-marker which is placed at the 
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starting node of each face does not work correctly. In Figure 5 we see a situation 
where we have an incorrect map, but the robot will not detect the error. The 
arrows indicate starting edges of each face. The node-marker is placed at the 
start of the corresponding starting edge. While tracing face /i of M, the robot 
will double-trace the “outer” face of G. Also, while tracing faces /2 and fs of 
M, the robot will trace the same portion of the “inner” face of G. It will never 
visit the two leaf nodes in G and their incident edges. 




Fig. 5. A single node-marker is not enough. 



Example 4: Algorithm 1 does not work when the map is not embedded in 
the plane. For instance, consider the example in Figure 6. The augmented Map 
M has the single face with signature 2, 2, 3, 3, 3, 2, 2, 3, 3, 3, 2. The corresponding 
face of G has the same signature. Map M in this example can be embedded on a 
torus, and has embedded-genus 1. Algorithm 1 will fail to observe any mismatch. 




4 Proof of Correctness of Algorithm 1 for Plane Maps 

We will show Algorithm 1 of the previous section works correctly when the given 
map M is embedded in the plane (even though the environment graph G may 
not be). We will also show that the total number of edges traversed by the robot 
during the algorithm (forward and backtracking included) is at most 4 times the 
total number of edges of the map. 

Theorem 1. Algorithm 1 of the previous section correctly verifies any aug- 
mented plane embedded map using a single directed edge-marker. Furthermore, 
the total number of edges traversed by the robot is at most 4 times the number 
of edges in the map. 
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Proof. Suppose M is an n-node augmented map and G is the augmented envi- 
ronment graph. It is not hard to see that each edge of the map will be traversed 
at most 4 times: Each edge is forwardly traversed twice, and with each such 
move the robot pushes an id of the edge onto the stack. Each edge backtracked 
corresponds to popping a previously pushed id from the stack. 

Now let us consider the correctness of the algorithm. If the algorithm finds 
a mismatch, then clearly the returned answer that M is not a valid augmented 
map is correct. Now suppose the algorithm returns the answer that M is a correct 
augmented map. We will prove the correctness of the answer by induction on 
the number of faces of M. 

Basis: In this case M has only one face and since it is embedded in the plane, 
it must be a tree. However, G may be any embedded graph. Suppose to the 
contrary that M is not a correct augmented map of G, but the algorithm makes 
the incorrect conclusion. Consider the smallest counter-example, that is, one in 
which M has fewest possible number of nodes. We will reach a contradiction 
by showing that there is even a smaller counter-example. Prom the description 
of the algorithm, M is not a single node or edge. Since each edge of M is a 
bridge (i.e., an edge whose removal will disconnect the graph), each edge of M 
will be traversed twice (in opposite directions) during the tracing of the single 
face of M. In other words, the length of the face of M, that is the number of 
edges incident to the face, is 2n - 2. While tracing the face of M, suppose the 
robot is tracing a face / of G. Since it will check the presence or absence of the 
edge-marker along edges, it will make exactly one complete round around face 
/ of G. Hence, length of / is also 2n — 2. 

Since M is a tree, it must contain a leaf node other than the two ends of the 
starting edge. Let x be the first such leaf of M that the robot reaches during 
the tracing of the face of M. When it reaches node x of M, suppose the robot 
is at node y of G. Since the degrees match, y must be a leaf of G, and the 
unique nodes adjacent to x in M and to y in G must also have equal degrees. 
As the robot continues the tracing of the face of M, it will encounter the leaf 
node X only once. Similarly, since it traces face / of G exactly one round, it will 
encounter leaf node y of G only once. Now consider the smaller counter-example 
pair (M',G') in which G' is the same as G with leaf node y and its incident 
edge removed, and M’ is M with leaf node x and its incident edge removed. 
Furthermore, consider the same initial position-orientation of the robot. When 
the algorithm is applied to (M', G') it will observe the same signature and hence 
return the incorrect answer that M' is a correct augmented map of G'. This 
contradicts the minimality of M. 

Induction: In this case M has at least two faces. Suppose fi and /2 are re- 
spectively the first and second faces of M traced by the algorithm. Suppose 
Ci = (t;i,Ui), z = 1,2, is the starting directed edge of face fi. Let /i, = {vi,Ui)^ 

z = 1, 2, be the corresponding objects of G. Because of the use of the edge-marker, 
length of fi is the same as length of /i, z = 1, 2, and they have a matching signa- 
ture. Because of the backtracking process, edge C 2 is the first edge, in opposite 
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direction, incident to face fi that was traversed only once during the tracing of 
fi. Hence, faces fi and /2 are incident to edge 62. Thus, 62 is not a bridge of 
M. Therefore, faces fi and /2 of M are distinct and share edge 62. 

As it makes the backtracking in M, the robot makes the same number of 
backtracking moves to trace back from edge ei to edge 62. Now we show that 
62 cannot be a bridge of G. At the start of tracing of the second face, the edge- 
marker is placed at the starting edges 62 and 62 of faces /2 and /2 of M and G, 
respectively. Since 62 is not a bridge of M, the edge-marker on 62 will be seen 
only once at the start and once at the end of the tracing of /2. Therefore, the 
same must hold in G, that is, the edge-marker on 62 will be seen only once at the 
start and once at the end of the tracing of face /2- We conclude that 62 cannot 
be a bridge of G either. If it were, then the edge-marker would have been seen 
once more, in the reverse direction, in the middle of the tracing of /2. Therefore, 
faces fi and /2 are distinct and share the edge 62- 

Figure 7 (a) shows the situation in M. Now we will perform a local surgery 
on M to construct a map M' with one fewer face than M. Simply remove edge 
62 from M. This will reduce the degrees of nodes V2 and U2 by 1. The surgery is 
depicted in Figure 7 (b). This surgery effectively merges the two faces fi and /2 
into a new face denoted by /i,2. Similarly, let G' be obtained from G be removing 
edge 62. This merges the two faces fi and /2 into a new face /i,2. Now consider 
the instance (M', G') with the same initial position-orientation of the robot as in 
the instance (M, G). Since fi and fi had the same length and the same signature, 
for z = 1, 2, the new faces /i,2 and /i,2 also have the same length and signature. 
Furthermore, the remaining faces, if any, traced by the robot will be exactly as 
before the surgery (except that end nodes of 62 and 62 have reduced degrees). 
Thus, if the instance (M',G') was the input, the robot would still answer that 
M' is a correct augmented map of G'. Since M' has one fewer face than M, 
by the induction hypothesis, M' is indeed a correct map of G'. Furthermore, 
since before the surgery we had matching signatures and matching backtracking 
length, in (M', G') the pair of node V2 and U2 will respectively match the pair of 
nodes V2 and U2. Therefore, M is a correct map of G. This completes the proof. 





Fig. 7. (a) Before the surgery, (b) after the surgery. 
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5 Implementation 

We have an animated simulation program for our verification algorithm for the 
case of plane embedded maps. The program is written in C embedded in an 
OpenGL/GLUT program to produce the graphics. The program runs on a Sun 
Ultra 2 running Solaris version 2.5.1. The program was tested successfully on 
a variety of simple map/graph pairs. The animation shows the progress of the 
(simulated) robot on the graph and its corresponding state and position on the 
map. A snapshot of the animation is shown in Figure 8. 




Fig. 8. The animation snapshot. 



6 Remarks and Discussion 

In the paper we presented an efficient algorithm to verify a plane embedded 
map of a graph world by a robot, with the aid of one edge markers. We assumed 
that the robot is told its initial position on the map. If the robot is not told 
this information, then the problem is one of self-location [DJMW2]: the robot is 
required to locate itself on the map, while verifying the map at the same time. 
It is an open question whether this self-location problem in a planar graph-like 
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world can be solved in 0{\V\ -h |E|) traversals on the edges by a robot with a 
single edge marker. 
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The authors would like to thank Arlene Ripsman for producing the animated 
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Abstract. We consider the on-line navigation problem of a robot inside 
an unknown polygon P. The robot has to find a path from a starting 
point to an unknown goal point and it is equipped with on-board cameras 
through which it can get the visibility map of its immediate surroundings. 
It is known that if P is a street with respect to two points s and t then 
starting at s the robot can find t with a constant competitive ratio. In 
this paper we consider the case where the robot is inside a rectilinear 
street but looks for an arbitrary goal point g instead of t. Furthermore, 
it may start at some point different firom s. We show that in both cases 
a constant competitive ratio can be achieved and establish lower bounds 
for this ratio. If the robot starts at s, then our lower and upper bound 
match, that is, our algorithm is optimal. 



1 Introduction 

Navigation among obstacles is one of the basic abilities required for autonomous 
mobile robots. One fundamental navigation problem is to plan a path from a 
given starting point s to a goal point g. This task is often further complicated 
by the fact that the environment of the robot may not be known completely 
in advance. It is usually assumed in this case that the robot is equipped with 
sensors that supply it with visual information about its local neighborhood. It 
can access the visibility polygon vis{p) of its position and it recognizes the goal 
when it sees it. 

Since the robot has incomplete information but, nevertheless, has to act im- 
mediately, its strategy can be considered as an on-line algorithm. A natural way 
to judge the performance of the robot is to compare the distance travelled by 
the robot to the length of the optimal path from s to g. In other words, the 
robot’s path is compared with that of an adversary who knows the environment 
completely. The ratio of the distance travelled by the robot to the optimal dis- 
tance from 5 to ^ is called the competitive ratio achieved by the robot’s strategy. 
If this ratio is bounded by c, the strategy is called c- competitive. In particular, 
if c is a constant, the strategy is simply called competitive. Competitive on-line 
searching has been considered in many settings such as searching among rectan- 
gles [BBFY94,BBF+96,BRS91,CL93,MI94,PY91], convex polygons [KP93], and 
on the real line [BYCR93,ELW93,Gal80]. 

Klein [Kle91] was the first to consider the problem of searching for a goal 
inside an unknown polygon. In arbitrary polygons with n vertices there is a 
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lower bound of n/2 — 1 on the competitive ratio, so no competitive strategy is 
possible. Klein gives a competitive strategy for a special class of polygons called 
streets. In this class, the points s and g are located on the polygon boundary and 
two chains from 5 to ^ in the clockwise and counterclockwise direction (called 
L- and i?-chain, resp.) are mutually weakly visible. Klein’s original strategy for 
searching streets achieves a competitive ratio of < 5.73 and he proves a lower 
bound of \/2 for the competitive ratio of searching streets. Recently, optimal \/2- 
competitive algorithms have been devised independently by Icking, Klein, and 
Langetepe [IKL99], and Schuierer and Semrau [SS99]. Other classes of polygons 
have also been shown to admit competitive strategies [Kle94,LOS97,Sch97]. 




Stronger results can 
be obtained for most nav- 
igation problems if the 
environment is rectilin- 
ear. Deng, Kameda, and 
Papadimitriou have de- 
vised an algorithm that 
competitively explores an 
unknown rectilinear poly- 

^ ^ gon completely [DKP981. 

Fig. 1. A robot searches for an arbitrary goal path of the robot 

point g inside a rectilinear street (P, h , * 2 ). jg compared to the shortest 

watchman tour. An algorithm for searching in general streets devised by 
Kleinberg [Kle94] achieves a competitive ratio of y/2 if the street is rectilinear. 
This is optimal since the lower bound of y/2 also holds for rectilinear streets. 
Classes of polygons that admit competitive search strategies in the rectilinear 
case include ^-streets [DI94], HV-streets, and 0-streets [DHS95]. 

Note that the definition of a street strongly depends on the two points s and 
g, so only a triple (P,s,g) is called a street. In contrast, we call a polygon P 
streetifiahle if there are two points ti and t ‘2 such that (P, ^ 1 ,^ 2 ) is a street, i.e. 
the boundary chains L and R w.r.t. ti and ^2 are mutually weakly visible. 

In all previously proposed strategies to search in a streetifiahle polygon P the 
robot can only start at a point ti and search for a point ^2 such that (P,ti,t 2 ) 
is a street. Much more than the structural constraints of streets it is this very 
restricted choice of the location of the starting point and the goal that limits 
the applicability of street searching strategies in practice. Whereas it may be 
reasonable to assume that the robot starts its search at a favourable location, 
such as ti, it is very unlikely that the goal — sometimes also called the hider — is 
placed at ^ 2 - This gives rise to two natural questions. 



- Is there a competitive strategy to search in a street (P,ti,t 2 ) starting at 
s = ti for an arbitrary goal gl Although it can be shown that every path 
from t\ to t ‘2 sees the whole polygon, the point g may become visible too 
late and the previously proposed algorithms which search for t ‘2 are not 
competitive in this setting. 
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- Is there a competitive strategy to search in a streetifiable polygon? Here, 
the robot may start anywhere inside a streetifiable polygon and looks for an 
arbitrary goal. 

Note that we do not assume in the second problem that the robot knows the 
points ti or t ‘2 nor that it is able to recognize them when they are seen. We 
only require the existence of t\ and ^ 2 - The second problem has also been called 
position-independent searching and has been investigated for star-shaped poly- 
gons [LOS97]. Note also that in the first problem the restriction of starting at 
ti is not as severe as it may seem. Of course, since we may exchange the label 
of ti and t 2 , we can also start at t^- In fact, there may be many possible choices 
for t\. For instance, it can be shown that in a star-shaped polygon P every 
boundary point p is a possible choice for ti, that is, there is a point in P such 
that (P,p,t 2 ) is a street. Hence, a strategy to search for an arbritrary goal in a 
street starting at t\ is also a position-independent search strategy in star-shaped 
polygons. In this way, our results can be viewed as a generalization of [LOS97]. 

In this paper, we provide answers to the above questions if the polygon 
is rectilinear. We present a simple strategy to search in a street (P,ti,t 2 ) for 
an arbitrarily located goal if we start at t\. It achieves a competitive ratio of 
\/2\Jl -f \pi « 2.61 in the metric. We also provide a matching lower bound 
showing that our strategy is optimal. Secondly, we present a competitive strategy 
to search in a streetifiable polygon P. The starting point and the goal can now 
be located everywhere inside P. It achives a constant competitive ratio that is 
less than 59.91 in the L\ metric. 

The rest of this paper is organized as follows. In section 2 we introduce 
some definitions and preliminary lemmas. In section 3 we present the strategy 
Complete-Search for searching for a goal g starting at 5 = ti. Section 4 
describes the strategy for searching in rectilinear streetifiable polygons. Lower 
bounds for both cases are presented in section 5. Due to the limited space most 
of the proofs are omitted. 

2 Preliminaries 

Definition 1 ([Kle91]) Let P be a simple polygon with two distinguished 
boundary points, ti and t^- Let L and R denote the boundary chains from t\ 
to t 2 - Then (P,ti,t 2 ) is called a street iff L and R are mutually weakly visible, 
i.e. if each point of L can be seen from, at least one point of R, and vice versa. 

Note that an equivalent definition for streets is that all points in P are weakly 
visible from every path from ti to t 2 [Kle91]. 

We denote the interior of P by I. Vertices of P where the internal angle 
is greater than tt are called reflex. Line segments inside P that intersect the 
boundary at both end points are called chords. A chord is called strict if it 
intersects the boundary in exactly two points. A strict chord divides the polygon 
into exactly two regions, general chords may create three or more regions. The 
explored region of a chord c is the region containing the starting point s and 
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denoted by expl{c)] the union of the other regions is called the unexplored region 
of c and denoted by unexpl{c). Similar to the regions of a chord the upper left 
region w.r.t. a point p can be defined. The segments from p to its projections 
upwards and to the left form the border of this region. The upper right, lower 
left and lower right region w.r.t. a point are defined analogously. 

From now on we only consider rectilinear polygons, that is, polygons whose 
edges are axes-parallel. The robot is defined by its position, which will be denoted 
by p in the following, and its search direction. The minimal chord that contains 
p and is orthogonal to the search direction is denoted by Cp. Note that Cp can 
degenerate into a point if p is located on a horizontal boundary edge. 

In the following we always rotate the scene such that the search direction is 
north. The maximal horizontal resp. vertical chord containing p is denoted by 
resp. Cp. Note that 3 Cp. The path followed by the robot up to position p is 
denoted by Vp, and its path between two points pi and p 2 is denoted by Vp^^p ^ . 
For a point g in P, a path V is Li-extensible to g if P can be extended to an 
Li -shortest path to g. The boundary of the visibility polygon of a point p inside 
P consists of parts of the boundary of P and other line segments which we call 
windows. The region of a window that is not visible from p is called a pocket. 
The maximal chord that contains a window w always contains p. The end point 
of w closer to p is called a landm,ark. We define two special kinds of landmarks 
that are important for our strategies, see Figure 2 for examples. 

Definition 2 Consider a landm,ark I inside P with two incident edges e and e' 
such that e is invisible from, p (except for the point 1). I is called an h-landmark 
(resp. v-landmark) with respect to p iff the m,axim,al chord Cg that contains e 
intersects Cp (resp. Cp).^ Cg is then called the chord induced byl. e is called the 
hidden edge of 1. I is called significant iffp is in the explored region of Ce‘^ 




Fig. 2. (a) h and h are significant v-landmarks. (b) h and h are h-landmarks. (c) h 
is no h- or v-landmark, Iq is a significant v-landmark, /y is an insignificant v-landmark. 



Lemma 1 Let I be a significant h- or v-landm,ark w.r.t. p. If s — ti, then t 2 is 
in the unexplored region of the induced chord ofl. 

^ In Section 3 we additionally require h- and v-landmarks to be in unexpl{cp). 

^ Note that h-landmarks are always significant if they are contained in unexpl(cp). 
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Landmarks that are neither h- nor v-landmarks are not important because 
they are hidden by other landmarks, as shown by the following lemma. 

Lemma 2 If a quadrant Q w.r.t. p does not contain an h- or v-landmark, then 
it does not contain a landmark at all 

3 Starting at ti and searching for g 

In this section we consider the case that the environment is a rectilinear street 
(P, We present an optimal search strategy for the case that the robot 

starts at ti and searches for a goal g anywhere inside P. The robot detects g 
when it is visible, but it does not need to recognize ^ 2 . In this section we only 
call a landmark I an h- or v-landmark w.r.t. p if Z is in unexpl{cp). 

3.1 Strategy COMPLETE- SEARCH 

The strategy proceeds in phases. At the beginning and end of each phase the 
search direction is axes-parallel. For convenience we always rotate the scene such 
that the search direction is north. If g becomes visible, the robot moves to it. 

Let 5 == ti be an interior point of a segment. The initial search direction of the 
robot is orthogonal to this segment. If s is a vertex, then a simple analysis yields 
the initial direction. The robot immediately does a case analysis. In the case 
analysis all landmarks are landmarks w.r.t. p and the current search direction. 

Case 1 There are no h-landmarks. 

We need the following two lemmas for the description of this case. 

Lemma 3 If no significant landmark is visible from, an end point p of a phase, 
then g is visible from the path from s to p. 

Lemma 4 At an end point p of a phase significant v-landmarks are not on both 
sides of Cp. 

The lemmas also hold for p = s. Thus, we may assume that at least one 
significant v-landmark is visible and all v-landmarks are on one side of c^. The 
robot turns right or left depending on the position of the v-landmarks. It does 
not move and starts a new phase, see Figure 3 (1). 




Fig. 3. Simple examples of the three cases of Strategy Complete-Search. 
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Case 2 There are h-landm,arks, and contains no significant v-landmarks. 

The robot maintains its direction and moves north until it encounters the first 
induced chord of an h-landmark, see Figure 3 (2). 

Case 3 There are h-landmarks, and Cp contains a significant v-landmark ly . 

This is the interesting case because the location of g is not constrained to one 
region of Cp or Cp. Let Ih be the topmost h-landmark. Assume that ly is to the 
left of p and is the leftmost landmark on c^. According to Lemma 4 all other 
significant v-landmarks are on the same side of Cp as ly. 

The robot moves diagonally to the left, at an angle of 45° to the horizontal 
axis. It stops when it either intersects the induced chord of ly or the induced 
chord of Ih- This happens because ly and l^ intersect due to the following Lemma. 

Lemma 5 If Ih is an h-landmark and ly is a significant v-landm.ark, then the 
induced chords of Ih and ly intersect. 

If the path of the robot intersects the chord induced by ly (resp. Ih), then 
it turns left (resp. right) by 45° — and the scene is rotated again such that the 
search direction becomes north. It then checks whether an h-landmark w.r.t. its 
current direction and position is visible. In the case shown in Figure 3 (3) this 
would be V^. If so, this landmark replaces the role of ly (resp. Ih), the robot turns 
back, and the diagonal path is continued. Otherwise, the robot turns right (resp. 
left) 90° and starts a new phase. 

To ensure that the strategy misses no landmarks in this phase we state the 
following Lemma. 

Lemma 6 At the tim,e that the robot checks for h-landmarks after reaching ly 
(resp. Ih) in Case 3 no significant v-landm,ark is located in the region to the left 
(resp. to the right) of p. 

3.2 Correctness and Analysis 

In this section we show that Strategy Complete-Search is correct and analyse 
its competitive ratio. First we prove the following invariants. 

Invariant T Before g becom,es visible the search direction always points into the 
unexplored region of Cp and the path from, s to p is Li-extensible to t> 2 - 

Invariant S At the start of each phase g is in the unexplored region of Cp. 

Theorem 7 Strategy Complete- Search is correct, i.e. the goal g is found. 

Proof: Invariant T shows that Vp is Li-extensible to ^2 until g becomes visible. 
Case 1 does not occur consecutively. In Cases 2 and 3 the robot moves at least 
a small constant distance on an Li -shortest path to t 2 - Since the distance from 
s to t ‘2 is finite. Invariant S shows that the goal g will eventually be found. □ 
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Lemma 8 Assume the robot sees g for the first time at p. If the segment pg is 
not axes-parallel then either g is visible from s or there is another location for g 
such that it is found later and the competitive ratio only increases. 

Since the competitive ratio is the maximum ratio for all locations of g we 
can assume that when the robot sees 5^ at p the segment pg is axes-parallel. 

Theorem 9 COMPLETE- SEARCH achieves a competitive ratio ofy/2y/2 -f- y/2 « 
2.61 in the L 2 m.etric for searching for an arbitrary goal inside a rectilinear street 
when starting at ti . 

Proof (Sketch): 

We can show that whenever Case 1 
applies, the path of the robot is L\- 
extensible to g. Thus, we need only 
analyse a path V where Case 2 and 3 
apply alternately. We show that until 
g becomes visible V is xy-monotone — 
say to the northwest. Furthermore, if 
V contains a northbound segment, g 
is above that segment, and g is to the 
left of any westbound segment. The 
worst case for the competitive ratio is as shown in Figure 4. The robot wal ks a 
distance of a-h\/5H-l-l-6, whereas the optimal path has length -f 1)^ -I- b‘^. By 
setting a = 0 and a = tt/S — 22.5° the competitive ratio achieves its maximum 
of V2V2 + V2, as claimed. □ 

4 Searching in streetifiable polygons 

We now generalize Strategy Complete- Search for the case s ^ ti, i = 1,2. 
This case is considerably more difficult and yields an algorithm for searching for 
a goal g inside an arbitrary rectilinear 

Strategy Streetifiable-Search 
consists of three states A, B, and C. 

For each state there is a corresponding 
condition that holds when the strat- 
egy enters that state. The strategy 
never enters a state twice, see Fig- 
ure 5, so we call the time that the 
strategy is in State x the Phase x. 

In State A the robot is caught in a side alley of the street and tries to get into 
the middle. Then in State B it has reached a point where at least two landmarks 
in opposite directions are visible that hide ti and t^- The robot uses four paths 
for the search that are explored in a fixed cyclic order. Each path is assigned a 
quadrant Q w.r.t. the starting point of Phase B. The distance walked on each 



streetifiable polygon. 




Fig. 5. State diagram of the strategy. 
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Fig. 6. Examples of cases where Condition A, B, or C holds. 



path increases exponentially in every step, an approach similar to the m-ray 
search described by Baeza-Yates et al [BYCR93]. They show that for the case 
of four straight rays this four way search achieves a constant competitive ratio 
that is less than 19.97. When one of the search paths recognizes a specific region 
5 of P and a point q such that {S,q,U) is a street for i = 1 or z = 2, two of 
the search paths enter State C. One path applies Complete- Search to <S, the 
second explores the region close to the entry of the street. The remaining two 
paths may enter State C later, too, if the same condition applies. 

4.1 Strategy Streetifiable- SEARCH 

We first state the three conditions that hold when the strategy enters the corre- 
sponding states. At a position p the robot identifies the set of h- and v-landmarks 
{li} and their induced chords {ci}. 

Condition A The set of all horizontal induced chords as well as the set of all 
vertical induced chords is located on one side of p. 

Condition B Condition A does not hold, i.e. there are two v-landmarks (resp. 
h-landm.arks) II and Ir such that p is between the intersection points of cl and 
cr with Cp (resp. Cp).^ 

Exactly one of the Conditions A and B applies at s. The strategy starts in 
the corresponding State A or B. The third condition may be encountered at 
some point pc in Phase B. 

Condition C There is a landm,ark I with a horizontal induced chord ci that is 
contained in the current search quadrant Q. There is an edge e on Cp such that 
e intersects ci and e hecom.es visible to the robot for the first time at pc- 

State A — One way search In Phase A the robot walks straight or diagonally 
towards the nearest h- or v-landmarks until Condition B holds. 

A.l. There are only h-landm,arks (resp. v-landm,arks) w.r.t. p. 

The robot moves towards the nearest induced chord of an h-landmark 
(resp. v-landmark) until another case applies. 

^ We include the case that p is located on either chord cl or cr, here II ot I r is 
considered a landmark although its hidden edge aligns with Cp (resp. Cp). 
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A. 2 . There are both h-landmarks and v-landmarks w.r.t. p. 

Let Cfi be the nearest horizontal induced chord and Cy be the nearest 
vertical induced chord. The robot walks diagonally in the direction of Ch 
and Cy until another case applies. 

A.3. Condition B holds. 

The strategy enters State B. 

If no h- or v-landmark w.r.t. p exists at all, then g must be visible due to 
Lemma 2. 



State B — Four way search Condition B holds at the starting point pB of 
this phase, so assume w.l.o.g. that there are two induced chords cl and cr of 
v-landmarks such that p is between them. One of ti and t 2 is to the left of cl and 
the other is to the right of cr, say t\ is to the right of cr. The robot starts four 
paths and searches them cyclically. The distance walked on each path increases 
exponentially each time it is visited. Each path explores one quadrant w.r.t. ps- 
The strategy applied by the search paths is as follows. 

Assume the search path explores the upper right quadrant Q w.r.t. pb- The 
other cases are symmetrical. The robot maintains a vertical chord Cy that is 
always to the right of p. Initially we set Cy = cr. The search direction of the 
robot is northeast or east, in Cases B.2 and B.4 it may also be north or south 
for a limited amount of time. At the beginning the direction is northeast. While 
moving the robot checks for the following cases. 

B.l. Cp aligns with a horizontal edge e that is above I and at least partially 
between p and Cy . 

The direction changes to east. The direction changes back to northeast if 
either Cy or the right end point of e are reached. 

B.2. A landm,ark I becomes visible to the right of Cy. Its induced chord ci is 
vertical and the upper end point of ci is in Q. 

If / is a v-landmark, then Cy is replaced by ci and the path continues. 
Otherwise if I is below Cp then the robot moves downwards until I becomes 
a v-landmark and does the same. If I is above Cp and no v-landmark, it is 
ignored. 

B.3. There is an h-landmark I to the right of Cy in Q whose induced chord ci 
intersects Cp but not the minimal vertical chord through p. 

Condition C applies, two of the search paths enter State C as described 
below. 

B.4. The robot arrives at Cy and Case B.3 does not hold. 

(a) There is an h-landmark I in Q that is above and to the right of Cy . 
The robot moves diagonally northeast until it reaches the induced 
chord Cl of L If the robot hits the boundary or intersects I the 
robot walks north until it reaches ci or can walk diagonally again. At 
Cl it repeats the case analysis. 

(b) There is no such h-landm,ark. 

The robot moves upwards until it hits the boundary and stops. 
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State C — Complete search and diagonal seeirch Due to Condition C 
there is a horizontal induced chord c/ that is contained completely in the search 
quadrant Q. Assume again that Q is the upper right quadrant w.r.t. pB- Let 
Pc be the point where the search path enters State C. There is an edge e on 
such that e intersects q and I is to the right of e. ti is in the region of c/ that 
does not contain ps because otherwise the hidden edge of I sees only one of the 
chains R and L. 

The robot walks to the intersection point qo^e and c/ . It then applies Strategy 
Complete-Search to the region S of ci that does not contain pc- We show 
that {S,q,ti) is a street. The robot walks until it sees g or there are no further 
landmarks. In the latter case the path ends. 

Since ti is in S the path that searches the lower right quadrant w.r.t. pb is 
obsolete. We use this path to search inside Q. When the robot proceeds with 
this path, it walks to pc and applies the following different strategy. It walks 
diagonally right and towards c/ until intersects the right end point pi of c/. 
The following cases may occur. 

C.l. p intersects ci. 

The robot walks right until it reaches the right end point pi of ci . 

C.2. Cp aligns with a vertical edge that is at least partially between p and ci. 
The robot walks straight towards c/ until it can move diagonally again or 
reaches pi . 

This concludes the description of Strategy Streetifiable- Search. We can 
show the following results. 

Theorem 10 Strategy Streetifiable-Search is correct, i.e. g is found. 

Theorem 11 Strategy Streetifiable-Search achieves a constant competi- 
tive ratio for searching in rectilinear streetifiable polygons that is at m.ost 59.91 
in the L\ m,etric and at m,ost 84.73 in the L 2 m,etric. 



5 Lower Bounds 

5.1 Starting at ti and searching for g 

We show a lower bound for searching a street completely starting at 5 = ti . The 
lower bound proves that the strategy Complete- Search is indeed optimal. 

Theorem 12 No strategy for searching for an arbitrary goal g inside a street 
starting at ti can have a competitive ratio less than c* := \/2\/2 -h ^2 « 2.61. 

Consider the polygon P in Figure 7. F is a street and the goal g may be 
placed in any of the small dents to the left of and above s. The angle a between 
the dashed lines and the axes is tt/ 8 = 22.5°. Suppose there is a strategy S for 
completely searching a street starting at s that achieves a competitive ratio of 
c* — e. Let V be the path generated by the strategy. 
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Consider the ith horizontal and the ith vertical dent as seen from s. Let hi 
and Vi denote the induced chords of the two corresponding landmarks. Let k> I 
be the distance between s and ho resp. vq- We examine the behaviour of S inside 
the region bounded by P, / iq , and vq. Qi (resp. Vi) is the point in the corner of 
the ith horizontal (resp. vertical) dent. The distance between qo and Cy and also 
between vq and Ch is k taxi a = {y/2 — 1 )k. 

The path V intersects both ho and vq. It cannot do so in their intersection 
point po because in this case the competitive ratio to reach g^o or ro is at least 
c*. Thus, w.l.o.g. V first intersects vq at a point Pv strictly below po and later 
intersects /iq at a point ph^ Ph is located strictly to the right of po because 
otherwise qq cannot be reached with the assumed competitive ratio of c* — e. 
Consider the next dent at the bottom of P. Choosing S sufficiently small, we can 
show the following Lemma. 



Lemma 13 V intersects v\ before reaching ho- 

We now look at the next dent on the right side and the next horizontal chord 
hi induced by its landmark. 



Lemma 14 After m,eeting ph the path V intersects hi before it reaches vo again. 



We have now reached 
a situation similar to the 
beginning of the proof — 

V first intersects vi at 
some point p'^ and later 
intersects hi at some 
point p'f^. We replace the 
constant k with k 8. 
Since Lemmas 13 and 14 
hold for every k> 1, we 
deduce by induction that 

V intersects Vn+i before 
intersecting hn and that 
it does not intersect vo 
between ph and its inter- 




Fig. 7. The lower bound. 



section with hn- Thus, V first travels to meet Vn+i and then goes back and 
intersects ho for the first time at ph- If we choose n large enough qo cannot 
be reached with the assumed competitive ratio. We have thus contradicted our 
assumption that there is a strategy that achieves a competitive ratio strictly 
smaller than c* . This completes the proof of Theorem 12. 



5.2 Searching in street iflable polygons 

If we choose a polygon that resembles a diagonal staircase from ti to t ‘2 then 
searching in P is equivalent to searching on a line, yielding a lower bound of 9. 
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Abstract. We study the problem of scheduling a set of n independent 
tasks on a fixed number of parallel processors, where the execution time 
of a task is a function of the subset of processors assigned to the task. 
We propose a fully polynomial approximation scheme that for any fixed 
e > 0 finds a preemptive schedule of length at most (1 + e) times the 
optimum in 0(n) time. We also discuss the non-preemptive variant of the 
problem, and present a polynomial approximation scheme that computes 
an approximate solution of any fixed accuracy in linear time. In terms 
of the running time, this linear complexity bound gives a substantial 
improvement of the best previously known polynomial bound [5]. 



1 Introduction 

In classical scheduling theory, each task is processed by only one processor at a 
time. However recently, due to the rapid development of parallel computer sys- 
tems, new theoretical approaches have emerged to model scheduling on parallel 
architectures. One of these is scheduling multiprocessor tasks, see e.g. [3,6,7]. 

In this paper we address some multiprocessor scheduling problems, where 
a set of n tasks has to be executed by m processors such that each processor 
can work on at most one task at a time and a task can (or may need to be) 
processed simultaneously by several processors. In the dedicated variant of this 
model, each task requires the simultaneous use of a prespecified set of proces- 
sors. A generalization of the dedicated variant allows tasks to have a number of 
alternative modes, where each processing mode specifies a subset of processors 
and the task’s execution time on that particular processor set. This problem is 
called general multiprocessor task scheduling. 

Depending on the model, tasks can be preempted or not. In the non-preemptive 
model, a task once started has to be completed without interruption. In the pre- 
emptive model, each task can be interrupted any time at no cost and restarted 
later possibly on a different set of processors. The objective of the scheduling 
problems discussed in this paper is to minimize the makespan, i.e. the maximum 
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completion time Cmax- The dedicated and general variants of non-preemptive 
(preemptive) scheduling for independent multiprocessor tasks on a fixed num- 
ber of processors are denoted by Pm\fixj\Cmax {P'iT^\fiXj,pmtn\Cmax) and 
Pm\setj\Cmax {Pm\setj,pmtn\Cmax)j respectively. 

Regarding the complexity, problems PS\fixj\Cmax and P3\setj\Cmax are 
strongly NP-hard [9], thus they do not have fully polynomial approximation 
schemes, unless P=NP. Recently, Amoura et al. [1] developed a polynomial time 
approximation scheme for Pm\fiXj\Cmax • For the general problem Pm\setj\Cmax , 
Bianco et al. [2] presented an approximation algorithm whose approximation 
ratio is bounded by m. Later Chen and Lee [4] improved their algorithm by 
achieving an approximation ratio y + e. Until very recently, this was the best 
approximation result for the problem, and it was not known whether there is 
a polynomial-time approximation scheme or even a polynomial-time approx- 
imation algorithm with an absolute constant approximation guarantee. Inde- 
pendently from our work presented here, Chen and Miranda [5] have recently 
proposed a polynomial-time approximation scheme for the problem. The run- 
ning time of their approximation scheme is where Xm,e = 

(2jm,e + ^)Bmm and jm,e < {SmBm -f with Bm < m\ denoting the 

mth Bell number. In this paper, we propose another polynomial time approx- 
imation scheme for Pm|setj|Cmax that computes an e-approximate solution in 
0{n) time for any fixed positive accuracy e. This gives a substantial improve- 
ment - in terms of the running time - of the previously mentioned result [5] , and 
also answers an open question of the latter paper by providing an approach that 
is not based on dynamic programming. 

It is known that the preemptive variant Pm\setj^pmtn\Cmax of the problem 
can be solved in polynomial time [2] by formulating it as a linear program with 
n constraints and variables and computing an optimal solution by using 
any polynomial-time linear programming algorithm. Even though (for any fixed 
m), the running time in this approach is polynomial in n, the degree of this 
polynomial depends linearly on m. Therefore it is natural to ask whether there 
are more efficient algorithms for Pm\setj^pmtn\Crriax (of running time, say for 
instance, 0{n) or O(n^) with an absolute constant c) that compute exact or 
approximate solutions. In this paper we focus on approximate solutions and 
present a fully polynomial approximation scheme for Pm|sety pmtnjCmax that 
finds an e-approximate solution in 0{n) time for any fixed positive accuracy e. 
This result also shows that as long as approximate solutions (of any fixed positive 
accuracy) are concerned, linear running time can be achieved for the problem; 
but leaves open the question (which also partly motivated this work) whether 
there exists a linear-time algorithm for computing an exact optimal solution of 
Pm\setj^pmtn\Cmax- This question was answered in an affirmative way for the 
dedicated variant Pm\fixj^pmtn\Cmax of the problem by Amoura et al. [1]. 
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2 Preemptive Scheduling 

In this section we consider the preemptive version Pm\pmtn,setj\Cmax of the 
general multiprocessor task scheduling problem. Suppose there is given a set of 
tasks T = {To, . . . ,Tn-i} and a set of processors M = m}. Each task 

Tj has an associated function tj : 2^ -> Q'^ U {+oo} that gives the execution 
time tj{S) of task Tj in terms of the set of processors S C M that is assigned 
to Tj. Given a set Sj of processors assigned to task Tj, the processors in Sj are 
required to execute task Tj in union, i.e. they all have to start processing task Tj 
at some starting time Tj, and complete or interrupt it at Tj + tj{Sj). A feasible 
preemptive schedule can also be defined as a system of interval sequences {Iji == 
[dji,bji), . . . , Ijs- — [djsj 5 bjsj)}, corresponding to the uninterrupted processing 
phases of task Tj, j = 0, . . . , n — 1, where bjq < for every j and q, and in 

addition there is an associated set Sjq C M of processors for each interval Ijq 
such that Sjq ^ Sj^q-^i for every j and q (if bjq = aj^q^i), and for each time step 
r there is no processor assigned to more than one task, i.e. Sjq D Sj>qf = 0 for 
every r such that r G Ijq D Ij>q>, j j'. The objective is to compute a feasible 
preemptive schedule that minimizes the overall makespan Cmax = max{bjsj : 
j = 0,...,n — 1}. First in Section 2.1, we formulate this problem as a linear 
program, and then based on this linear programming formulation we give a 
linear time approximation scheme for Pm\pmtn,setj\Cmax in Section 2.2. 

2.1 Linear Programming Formulation 

Let C C T, and assume that \C\ = k. Later C will be selected such that it will 
contain a “small” number k = const (m) of “long” tasks. Let P\ — M, P2 = 
{2, . . . ,m},P3 = {1,3, .. . ,m }, . . . ,P2m_i = {m},P2m = 0 be a fixed order of 
all subsets of M. Since preemptions are allowed, every feasible (and therefore 
also every optimal) schedule can be modified such that it remains feasible with 
the same makespan T and in addition it has the following property. The time 
interval [0,T) can be partitioned into 2'^ consecutive subintervals /(Pi) = [^o = 
0,^1), /(P2) = [^1,^2), • • • ,/(P2"») = [t 2 ^-i,t 2 rr^ = T) such that during I{Pi),i = 
1, . . . ,2"^, tasks in C are processed by exactly the processors in P^ C M. Each 
processor in Pi is working on one of the tasks from C, and all the other processors 
are either idle or executing tasks from S. (See Figure 1; notice that the lengths 
ti^i —ti of the intervals are in general different and determined in the LP below.) 

For tasks from C and processors in Pi, i = 1 , ... ,2^ — 1, we consider as- 
signments fi : Pi C. Let Ai denote the set containing all of them. Let 
Fi = M \ Pi he the set of free processors for executing tasks from S during 
the interval /(Pi). Notice, that Pi = 0 and P2m = M. For processor sets Fi, 
i = 2,. . . , 2^, let PFi.qi Q = I5 • • • 5 denote the different partitions of Fi, and 
let Vi = {PFi,h • • • 5 Finally, to simplify the notation below, we introduce 

the following indicators for every fi C M and every Ppi,g ^ Pi? i = 2,. . . ,2'^: 
^Fi.qilP) = 1, if // G PFi,q, and 0 otherwise. 

The following variables will be used in the linear program: 
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to fi t2 tz t4 is te tj fs 




Fig. 1. Schedule with processors 1, 2, 3. 



tf. the starting (and finishing) time of the interval when exactly the processors 
in Pi+i, 2 = 0, . . . , 2"^ - 1 {Pi, i = 1, . . . , 2"^) are used for processing long 
tasks from C. 

Uf\ the total processing time corresponding to the assignment f e A{^ i = 

Zjii the fraction of task Tj € C processed in the interval I{Pi) = i = 

xpi.q- the total processing time for partition Ppi,q ^'Pii 9 = 1, . . . , n*, i = 2, . . . , 2"*, 
where exactly processors of are executing short tasks and each subset of 
processors /x G Pfi,q executes at most one short task at each time step in 
parallel. 

: the total processing time for all tasks in 5 = T \ C executed on processor 
set /X. 

yj^: the fraction of task Tj G S executed by the processor set /x C M, ^ ^ 0. 
(By introducing these variables we can handle the possibility of non-fixed 
processor assignments also for the tasks in 5. Later, for the non-preemptive 
version of the problem, the |/j^’s will be 0 - 1 variables.) 

We consider the following linear program. 

Minimize t 2 m 



(0) 


to — 0, 






(1) 


ti ^ ti—i , 






(2) 


ti ti—i - 


- YlfeAi i — 1,. . ■ 


,2--l, 


(3) 


Uf > 0, 




-1, 


(4) 


S/iCPj H 


ti{p) - 


Vje£, i = l,. 


(5) 


Ei=i 


i = 1, Vj € C, 




(6) 


o' 

Al 


Vie£, i = l,...,2"* 


-1, 


(7) 


x—yrii 


q 1 , i — ‘2j . . 


om 


(8) 


/L^i=2 2^q= 


.l^Fi,qin) ■XF,,q > , 


V/x C M, ^ 0, 
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(9) XFi,q>0, i==2,...,2^, q = l,..,,rii, 

( 19 ) Y^Tjes ‘ yj^^ ~ — “^5 ^ 05 

(11) S/iCM,/z^0 yj^^ — ^ 

(12) 2/j> > 0, VT,- G 5, V/z C M, /X ^ 0. 

Constraints (0) — (1) define the endpoints of the intervals [ti,tij^i), i = 
0,1,..., 2"^ — 1. The further subdivision of these intervals corresponding to 
the different assignments in Ai is described by (2) — (3). The processing time 
of each long job can also be partitioned according to the processor sets 
i = 1, . . . , 2"^ — 1, and each of these fractions has to be covered as it is formu- 
lated by (4) — (6). The inequalities of (7) require for every set of free processors 
Fj, i = 2, . . . , 2"^, that its total processing time (which is the sum of processing 
times corresponding to the different partitions) be bounded by the length of the 
interval Furthermore, the inequalities of (8) guarantee that there is 

enough time for the execution of all /i-processor tasks in S. In (10), the total 
processing times required by short tasks using processor set /x C M, /x ^ 0, 
are expressed in terms of variables x/j>- Finally, constraints (11) and (12) for- 
mulate the possibility of non-fixed processor assignments for short tasks in S. 
Notice that solutions of the above linear program allow for each task from S 
to be executed parallel on different subsets /x of processors. Thus the schedule 
obtained directly from a solution of this linear program might contain some in- 
correctly scheduled tasks, which therefore have to be corrected afterwards. (See 
Section 2.3.) We leave the proof of the following statement for the full version 
of the paper. 

Lemma 1. In the preemptive schedule corresponding to any feasible solution of 
(0) — (12), every task Tj is processed completely. 

Let dj = miUfj^cM tj{/J>) be the minimum execution time for task Tj, and 
let D = Then, the minimum makespan OPT among all schedules 

satisfies ^ < OPT < D. By normalization (dividing all execution times by D), 
we may assume that D = 1 and that ^ < OPT < 1. This implies that for any 
optimal schedule, and for every /x C M, /x ^ 0, the total execution time of 
tasks executed by processor set /x can be bounded by 1. 



2.2 Solving the Linear Program 

Fix the length of the schedule to a constant 5 G [^, 1]. Let LP(s, A) denote the 
linear program obtained from (0) — (12) by setting t 2 m = s and replacing (8) 
and (10) with the following constraints. 

(®) Yi=2 • ^Fi,q ^ Ij Q M , /X ^ 0, 

(19) 'yj^ ~Yi=2Yq=l ^Fi,q{p>) ’ ^Fi,q /X C M, /X ^ 0. 

Note that we have eliminated the variables but bounded the correspond- 
ing lengths for the configurations by 1. Since OPT < 1, Y1=2 Y^Li ^Fi,q < 1 
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^Fi^qifJ') ' ^Fi,q < 1 (for Optimal solution). The problem 
LP{s^ A) has a special block angular structure. The blocks Bj = {yj : > 

~ 1} fo^ ^ ^ 2"^-dimensional simplicies, and the block 
= {[ti,Uf,Zji,XFi^q) : t2m = s and conditions (0) - (9)} contains only a 
constant(m) number of variables and constraints. The coupling constraints are 
the replaced linear inequalities (10). Note that for every fi C M, /i 0, the 

function = Ylr^es ' yj^^ ~ E^li ^Fi,q{^) * XF,,q -f- 1 is non-negative 
over the blocks. 

The Logarithmic Potential Price Directive Decomposition Method [8] devel- 
oped by Grigoriadis and Khachiyan for a large class of problems with block 
angular structure provides a p-relaxed decision procedure for LP(s,A). This 
procedure either determines that LP{s, 1) is infeasible, or computes (a solution 
that is nearly feasible in the sense that it is) a feasible solution of LP(s, (1 -f p)). 
This can be done (see Theorem 3 of [8]) in 2^{m + p~^ In p~^) iterations, where 
each iteration requires 0(2"^ lnln(2"^p“^)) operations and |5| H- 1 < n + 1 block 
optimizations performed to a relative accuracy of 0{p). In our case each block 
optimization over ^i,...,5|5| is the minimization of a given linear function 
over a 2"^-dimensional simplex which can be done (not only approximately, but 
even) exactly in 0(2"^) time. Furthermore, the block optimization over 
is the minimization of a linear function over a block with a constant (m) num- 
ber of variables and constraints, which clearly can be done in constant (m) time. 
Therefore for any fixed m and p > 0, the overall running time of this procedure 
for (approximately) solving LP{s,X) is 0{n). 

Lemma 2. The following assertions are true: 

(1) If LP{s, 1) is feasible and s < s' , then LP{s', 1) is feasible. 

(2) If LP{s, 1) is infeasible, then there is no schedule with makespan at most s. 

(3) LP{OPT, 1) is feasible. 

This lemma (whose proof is omitted here and left for the full version of the paper) 
implies that one can use binary search on 5 G [^,1] and obtain in O(logY) 
iterations a value s < OPT{l -f f ) such that LP(s, (1 + p)) is feasible. 



2.3 Generating a Schedule 

In this subsection, we show how to generate a feasible schedule using an approx- 
imate solution of the previously described linear program. Consider the solution 
obtained after the binary search on s. The inequalities of (10) imply that for any 
p C M, p / 0, 



2^ m 

G(/^) ■ ■ XFi,q < P- 

Tj^S i—2 7—1 

Let = Y^i= 2 Yj^=\iFi,q{l^) • ^Fi,q, the free space for small tasks that use 
processor set p. The idea is to shift a subset S C S oi small jobs to the end 
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of the schedule such that ^ Then, the subset 5 \ 5 of 

remaining small jobs fits into the free space for the //-processor tasks. 

In the following, we show how to compute such a subset S in linear time for 
any fixed m. First we modify the //-components of the solution obtained from the 
linear program. The key idea is to allow no change of processor sets // C M (or to 
have fractional //-components) for small tasks. Using this assumption it is easier 
to compute a set S and a feasible schedule for the small tasks. Furthermore, 
only a few small tasks will have fractional //-components. The //-components 
of the approximate solution of LP{s, (1 -h p)) can be considered as fractional 
assignments. Let the lengths of y be defined as 

Then for every // C M, we have p. A fractional assignment y can be 

represented by a bipartite graph G — {Vi,V2^E), where Vi and V2 correspond 
to row and column indices of y (tasks and subsets of processors), respectively, 
and (j, p) e E if and only if yj^ > 0 . 

Any assignment y represented by a bipartite graph G of lengths L^, // C M, 
can be converted in 0(n2^"^) time into another (fractional) assignment y of 
lengths at most p C M represented by a forest [ 12 ] (see Lemma 5 . 1 ). For 
a fractional assignment //, a task Tj has a non-unique processor assignment if 
there are at least two processor sets // an //', // 7^ //', such that yj^ > 0 and 
Vjn> > 0 . Let Hi be the set of tasks with non-unique processor assignments. For 
a y assignment corresponding to a forest, we have \Ui\ <2^ — 1. 

In the next step, we compute for every p C M a, subset C S \ Ui of 
tasks with yj^ = 1 for all tasks Tj G 5 ^ such that the total execution length 
there is a task Tj^ G for which ^ P' 

Let II2 — {Tj^ \p C M}. In total, we get a set Ui UW2 of tasks with cardinality at 
most 2"^+^ — 1, and a subset of tasks V = (U;xCm 5'^)\^2 of total execution length 
2 ^p. By choosing p = 47^^ 5 the total execution length of V can be bounded by 
47^ ^ jOPT. Using that s < (1 + j)OPT^ this implies the following lemma. 

Lemma 3. The objective function value of the computed linear programming 
solution restricted to T' = T\ {U\ IMA 2 ) is at most OPT + ^OPT, and \Ui \SIA 2 \ < 

2m+l _ Y, 

The next step of the algorithm computes a schedule for the tasks in T" = 
T\(WiUW 2UV). (The tasks in W1UW2UV will be scheduled afterwards at the end.) 
The makespan of the computed schedule is at most (1 + |)OPT. Furthermore, 
the total execution time for V is bounded by |OPT. Note that each small task 
Tj G T' n <S has a unique set of assigned processors. 

Let be the total processing time for all tasks in T” fl S assigned to 
processor set p, and let (t*,u*,z*,x*^y*) be a solution of the linear program 
with objective value ^2^. In the first step, accordingly to the variables 
and Zjj^ we compute a schedule for the long tasks Tj G £. As a result, during the 
interval [ti_i,t*) only the processors in Pi are used by the long tasks for every 
i = 1 , . . . , 2 "^. This can be done in constant time. 

In the second step, we schedule all //-processor tasks in S' = T" fl S for 
every //CM. From the left to the right (starting with the second interval), we 
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place the tasks of S' on the free processors in Fi = M \ Pi for each interval 
(and 2 < i < 2'^). To do this, we consider each partition Ppi^q of F{ 
with value > 0. For each set ji G we place a sequence of tasks that 

use processor set ji with total execution length Xp. If necessary, the last (and 

first) task assigned to fj, is preempted. Since ^Fi.qil^) • xp.^q > 

this procedure completely schedules all small tasks (assigned to processor set /i, 
for every // C M), and it runs in 0{n) time. Note that the computed schedule 
is feasible. 

2.4 Selecting the Cardinality of C 

The following Lemma that was proved in [11] will be used to select the constant 
A;, i.e. the cardinality of £, such that the total processing time of Ui UZY 2 can be 
bounded. We mention in passing that this lemma (or various simplified variants 
and corollaries of it) can be used in designing polynomial approximation schemes 
for other scheduling problems as well, see [1,5, 10, 11]. 

Lemma 4. Suppose d\ > (I 2 > ■ . .dn >{) is a sequence of real numbers and 
D = P 5 Q nonnegative integers, a > 0, and assume that n is 

sufficiently large (i.e. all the indices of the di’s in the statement are smaller 
than n; e.g. n > ([^]p+ 1)(^ + l)l^«^ suffices). Then, there exists an integer 
k = k{p,q,a) such that 



dk F . . . H~ ^ ol • D, 

and 

k < (g' + 1)^«^“^ +p[l + (g + 1) H- . . . + (g^ + 1)'^“^“^]. 

If g > 0, the bound on k simplifies to [{p + q){q-\- 1)^«^“^ ~p]/q^ while for g = 0, 
we obtain that k < 1 — 1). 

In our problem, we have p — 2"^+^ — l,q = 0 and oc = Therefore Lemmas 
3 and 4 imply that there exists a constant A: < 1 -f (2"^+^ — 1)(|’^] — 1) such 
that the total execution time of the tasks in Ui \JlA 2 can be bounded by ^OPT. 
Furthermore, the makespan for the partial (feasible) schedule of T\ {Ui UU 2 ) is 
at most (1 -h ^)OPT. Thus the overall makespan of the (complete) schedule is 
bounded by (1 + e)OPT. According to the arguments above, for any fixed m and 
6>0, all computations can be carried out in 0{n) time. Thus, we have proved 
the following result. 

Theorem 1. There is an algorithm which given a set of n independent tasks, 
a constant number of processors, execution times tj{p), for each task Tj and 
subset p of processors, and a fixed positive accuracy e, produces in 0{n) time a 
preemptive schedule whose makespan is at most (1 -h e)OPT. 

It is easy to check that for any fixed m, the running time of this algorithm 
depends only polynomially on hence it provides a fully polynomial approxi- 
mation scheme. 
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3 Non-Preemptive Scheduling 

In this section, we study the non-preemptive variant Pm\setj\Cmax of the general 
multiprocessor scheduling problem. Given a set Sj of processors assigned to task 
Tj^ the processors in Sj are required to execute task Tj in union and without 
preemption, i.e. they all have to start processing task Tj at some starting time 
Tj, and complete it at Tj H- tj{Sj). A feasible non-preemptive schedule consists 
of a processor assignment Sj C M and a starting time Tj > 0 for each task Tj 
such that for each time step r, there is no processor assigned to more than one 
task. The objective is to find a feasible non-preemptive schedule that minimizes 
the overall makespan Cmax = max{rj + tj(5j) : j = 0, . . . , n — 1}. 



3.1 Linear Programming Formulation 

In this section, first we consider a mixed 0 — 1 integer linear program which 
is closely related to non-preemptive multiprocessor scheduling. Similar formula- 
tions were studied in [ 1 , 10 ] for restricted (dedicated and malleable) variants of 
the problem. Based on this linear programming formulation we will give a linear 
time approximation scheme for Pm\setj\Cmax- 

Let C C T. A processor assignment for £ is a mapping / : £ -> 2^. Two 
tasks Tj and Tj/ are compatible, if /(Tj) fl /(Tj/) = 0. For a given processor 
assignment for £, a snapshot of £ is a subset of compatible tasks. A relative 
schedule of £ is a processor assignment f : C ^ 2^ along with a sequence 
M(l), . . . , M(^) of snapshots of £ such that 

— each Tj G £ occurs in a subset of consecutive snapshots M(aj), . . . , M{ujj), 
1 < (Xj < ujj < g, where M{aj) is the first and M{ujj) is the last snapshot 
that contains Tj; 

— consecutive snapshots are different, i.e. M{t) ^ M{t + 1) for 1 < t < g — 1; 

— M(l) ^ 0 and M{g) = 

A relative schedule corresponds to an order of executing the tasks in £. One 
can associate a relative schedule for each non-preemptive schedule of £ by looking 
at the schedule at every time where a task of £ starts or ends and creating a snap- 
shot right after that time step. Creating snapshots this way, M(l) 7 ^ 0, M{g) = 0 
and the number of snapshots can be bounded by max( 2 |£|, 1 ). Given a relative 
schedule R = (/, M(l), . . . , M(p)), the processor set used in snapshot M{i) is 
given by P{i) — UTGM(i) /(^)* L^t T denote the set containing (as elements) all 
the different (M\P(i)) sets, i = 1, . . . ,g, (Thus the sets in T are the different sets 
of free processors corresponding to R.) For each F G T*, let Pp^i^i = 1, . . . ,nir, 
denote the different partitions of F, and let Vp = {Pp,i, • • • -iPp.nF}' Further- 
more let be the total processing time for all tasks in 5 = T \ £ executed on 
processor set p C M. Finally, we define the indicators for every pi C M and every 
Pp,i € Ff, i = 1, . . . ,np, F G F, similarly as before: ^p,i{pi) = 1, if /i G Pp,i, 
and 0 otherwise. 
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For each relative schedule R = (/, M(l), . . . , M{g)) of £, we formulate a mixed 
0-1 integer program ILP{R), as follows. For every Tj G let pj — tj{f{Tj)). 

Minimize tg 



(0) 


to — 0, 








(1) 


^ ^i—1 7 ^ 


= 1,- 


■■,9, 




(2) 


^ujj taj—l ~ 


--Pj, 


^Tj e C, 




(3) 


Xj;P(i) = M\F 


{u-t 


■i-i) = ep, VF e 


F, 


(4) 


Er=i < 


ep, 


VFeF, 




(5) 


Y^nF 




• > F'', V/i 


in 


(6) 


XF,i > 0, VF G F 


, i 1 , . . . , Tip , 




(7) 






'iiJ.CM, fi 


7^0, 


(8) 


YlfiCM ~ 


= 1, ’ 


iTj G 5, 




(9) 


Vj^i G {0, 1}, 


Wj 


€ 5, V/i C M, /i 7^ 


0. 



The variables of ILP{R) have the following interpretation: 

ti'. the time when snapshot M{i) ends (and M{i + 1) starts), i = 1, . . . ^ g — 1. 
The starting time of the schedule and snapshot M(l) is denoted by = 0 
and the finishing time by tg. 

ep: the total time while exactly the processors in F are free. 

XFX- the total processing time for Pp,i G 'Pf, i = 1, . . . ,tif,F G P, where only 
processors of F are executing short tasks and each subset of processors Fj G 
Pf,x executes at most one short task at each time step in parallel. 
yj^: the assignment variable indicating whether task Tj G 5 is executed on pro- 
cessor set i.e. 

_ J 1, if Tj is executed by the processor set 
\ 0, otherwise. 

The given relative schedule R along with constraints (1) and (2) define a 
feasible schedule of C. In (3), the total processing times cf, for all F G T 
are determined. Clearly, these equalities can be inserted directly into (4). The 
inequalities in (4) require for every set of free processors F G F that its total 
processing time (corresponding to the different partitions) to be bounded by cf- 
Furthermore, the inequalities (5) guarantee that there is enough time for the 
execution of all tasks in S that use processor set p.. The constraints of (8) — (9) 
describe the processor assignments for the tasks in S. The equations of (7) 
express for every /l/ C M, // 7^ 0, the total processing time of all tasks in 
S that are executed by the processor set p. As before, these equations can be 
inserted directly into the inequalities of (5). 

We will use the relaxation LP{R) of ILP{R), where we replace the 0 — 1 
constraints by the inequalities yj^ > 0 for every Tj G S and p C M. Notice that 
the solutions of LP{R) allow for each task from S to be preempted or to be 
executed on different subsets p of processors. Thus there might be incorrectly 
scheduled tasks in the schedule based on the solution of LP{R). These have 
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to be corrected afterwards. It is easy to check that the approach we used in 
Section 2.2 for solving the linear program of the preemptive problem can also 
be applied to LP{R). This way one can obtain the same approximation and 
complexity bounds as those in Section 2.2. 

3.2 Generating a Schedule 

Similarly to the preemptive variant, one can also compute a subset S C S ol 
small jobs such that set <S \ 5 of remaining 

small tasks fits into the free space for the //-processor tasks. The computation of 
S can be done in an exact same manner as in Section 2 . 3 . Hence the following 
lemma also holds. 

Lemma 5. The objective function value of the best approximate solution of 
LP{R) (over all relative schedules R) restricted to T' — T \ (U.\ U ^2) *5 at 
most OPT + ^OPT, and \Ui UZ^2| < 2 "^+^ - 1 . 

The next step of the algorithm requires the computation of a pseudo-schedule 
PS(T") for the tasks in T” — T \ {U\ U ZY2 U V), in which we allow that some 
tasks from S are preempted. The makespan of the computed pseudo-schedule is 
at most OPT -h jOPT. Furthermore, the total execution time for V is at most 
jOPT. We note that each task Tj G T' fl has a unique subset of assigned 
processors. Let be the total processing time for all tasks in T" fl S assigned 
to subset pi. We schedule all //-processor tasks in S' = T" C\S for every subset 
//CM. From left to right (starting with the first snapshot M(l)), we place the 
tasks of S' on the free processors in M\P{i) for each snapshot M(i) (and 0 < / < 
g). To do this, we consider each partition Pf{i),i of F{i) — M \ P{i) with value 
^F(i) £ ^ partition Pr(i),£, we place a sequence of tasks 

that use processor set pi with total execution length If necessary, the last 

(and first) task assigned to pi is preempted. Since YlpeT ^^^=1 iF,i{p^)'^F,i > T)^ ^ 
this procedure completely schedules all tasks (assigned to processor set //) for 
every //CM, and it runs in 0 {n) time. Let W be the set of preempted (and 
therefore incorrectly scheduled )tasks in PS{T"). The following lemma gives an 
upper bound on the cardinality of >V. 

Lemma 6. |W| < 2(m — 1)A: -h 2m"^'^^. 

Lemmas 5 and 6 imply the following upper bound on the total number of 
tasks that we have to shift to the end of the schedule. 

Corollary 1. lZYiUZY2U>V| < 2 (m- l)fc-f 2 m"^+i -h 2 "^ < 2 (m- l)fc-h 3 m"^+L 

By applying Lemma 4 with values p = q = 2(m — 1) and a = ^5 we 

obtain that there exists a constant k = k{m,e) < 1 -f 4 m”^( 2 m)^’^/^ = 
such that the total execution time of the tasks in ZYi U Z^2 U W can be bounded 
by ^ < f OPT. Furthermore, the makespan for the partial (feasible) schedule 
of T \ {Ui UK2) is at most (1 -h ^)OPT. Thus, the overall makespan of the 
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(complete) schedule is bounded by (1 + e)OPT. Accordingly to the arguments 
above, for any fixed m and e > 0, all computations can be carried out in 0{n) 
time. Thus, the following result holds for Pm\setj\Cmax‘ 

Theorem 2. There is an algorithm which given a set of n independent tasks, 
a constant number of processors, execution times tj{fi), for each task Tj and 
subset pL of processors, and a fixed positive accuracy e, produces in 0{n) time a 
non-preemptive schedule whose makespan is at most (14- e)OPT. 

Acknowledgment: The authors thank P. Dell’ Olmo for helpful discussions 
on the preemptive scheduling problem studied in this paper. 
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